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PREFACE 


The plan for Statistics in Psychological Research was developed, in outline 
and detail, over a six-year period, during which the author was a member of 
the faculty of the Department of Psychology at Pennsylvania State University. 
In twelve semesters, nearly five hundred advanced undergraduate students 
and graduate students were enrolled in a statistics course which was taught by 
the author and which served as the testing ground for the methods and 
materials of the text. The students were primarily majors in psychology, but 
students from education, speech, home economics, civil engineering, physical 
education, mathematics, and mineralogy constituted a substantial proportion 
of the enrollment. In addition, numerous members of the faculty from 
various departments of the University audited the course. 

The text is intended for a one-semester course in psychological statistics. 
It may be used as an introductory text for advanced undergraduates and 
graduate students. It may also be used as a second- or intermediate-level text 
for psychology majors in general. It covers the standard topics, but concen- 
trates on critical issues rather than some minor or peripheral matters. 
Emphasis is given to the distinction between descriptive and inferential 
statistics, between experimental and non-experimental research, and between 
testing significance and estimating parameters. 

Considerable space is devoted to developing the concept of psychological 
statistics as an important part of the language of the research psychologist. 
To assist the reader in developing this concept, chapters are identified as 
referring to syntactics, semantics, or pragmatics. Thus an attempt has been 
made throughout the text to Organize the discussion of statistical issues 
around certain fundamental notions about the study of language. This 
language approach to the study of psychological statistics assists the student 
in recognizing the nature of a given issue and its resolution without introduc- 
ing any additional assumptions into the discussion or placing any restrictions 


on the standard treatment of the subject. 


vi 


Dr. Kendon R. Smith and Dr. Lewis R. Aiken, Jr., presently my colleagues 
at The Woman's College of the University of North Carolina, made many 
helpful criticisms and suggestions. 

I am indebted to the authors and to the publishers, Oliver & Boyd Ltd.. 
Edinburgh, for permission to reprint Table No. ILI from Statistical Methods 
for Research Workers by Professor Sir Ronald A. Fisher. and Table No. IH 
from Statistical Tables for Biological, Agricultural, and Medical Research by 
Professor Sir Ronald A. Fisher and Dr. Frank Yates. 

Finally, I must give credit to my students whose expressions of interest 


and enthusiasm over the past six years were a constant source of encourage- 
ment in bringing this project to completion. 


W. S. R. 
Greensboro, North Carolina 
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STATISTICS IN PSYCHOLOGICAL RESEARCH 


LANGUAGE, SCIENCE, 
AND STATISTICS 


E 


We shall begin with a definition. 
A language is a system of interrelate 


à social group for purposes of communication. 
In the totality of human language, one can discern numerous overlapping 


systems and subsystems which vary considerably in size and complexity, and 
which are combined in various Ways to suit the purposes of their users. The 
familiar, extensive sign systems of everyday life, such as English, French, 
German, and Russian, are often called natural languages to distinguish them 
from certain specialized and restricted. languages. One specialized and 
restricted system of signs which is of interest to us here is what might be 


called the language of empirical science. The language of empirical science 
onsisting of technical language 


can be described, in general terms, aS © 
of various sorts along With some parts of one or more natural 


languages. 

Progress in empirical science, as the modern world has known it, would 
have been impossible without language. TO achieve new additions to one's 
knowledge of natural phenomena, observation, a basic component of em- 

hich can be accumulated, 


pirical research, must be converted to data W i 1 
analyzed, and reported. Language is necessary In recording and storing data, 


in organizing and systematizing data, in describing and predicting natural 
Processes and events represented by the observations, and in generalizing and 


communicating the final res 
, Although mathematics is 4 
important part of the more complex 
Weise Heer is a system O14 emarkably adapted 
i mined. A 
ede completely aes and the transformation of these 


to the formation of relations among signs "Mr 
relations. As a vehicle for logic and deduction it 1$ unequaled by any other 
1 


d signs employed by the members of 


ults. . : 
it also constitutes an 


m of empirical science. 
terrelations are unam- 


in itself, 
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language system. In mathematics very complex and extended operations upon 
certain initial sets of signs will yield numerous other sets without error or 
controversy. It is, finally, the language in which the scientist sets forth his 
propositions with the greatest precision. 

Mathematical statistics, a branch of mathematics, is also a language. (The 
suffix “ -ics " of the word “ statistics,” as the term is employed at this point, 
indicates a branch of systematized knowledge and the term is construed as 
singular in number.) Mathematical statistics is a system of interrelated signs 
which includes certain special symbols in addition to the common symbols 
of arithmetic, algebra, geometry, calculus, and probability theory. It shares 
in the deductive perfection of mathematics. 

Mathematical statistics has developed rapidly and extensively in the 
present century. It has become an important part of the language of the 
biological and behavioral sciences. The significance of this development is 
indicated by the fact that the formal design and analysis of research in these 
Sciences are now largely guided and determined by statistical theory. 


Psychology and education share this concern with and dependence on 
statistics. 


The Study of Language 


There are numerous ways of analyzing and describing language, but the 
approach which seems particularly appropriate and valuable to the present 
discussion is that of Rudolf Carnap and Charles W. Morris, whose scholarly 
contributions in this area are widely known. (Selected writings by these 
authors are listed at the end of Chapter 1 as suggested readings.) We wish 


to indicate that the discussion which follows is based to a considerable 
extent, but not entirely, 


brief and introductory 
pretense of being high! 


referent. 


m a ihe Pea of language. Signs may be letters, numbers, words, or 
uen as "9, qu and *€ als Signs may be combined, arranged 


and composed into Sentences. 
is an object,! a property of an object, a 
; a function involving relations, or a complex 


1 E RS RE 
When we say Object," we intend “ object or event." 
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situation involving objects, properties, relations, and functions.? What is 
referred to is said to be designated by the sign and, in consequence, is called 
the designatum of the sign. 

In responding to a sign, an interpreter takes account of the designatum. 
Any attempt at a complete explanation or understanding of the response to à 
sign would have to include, as possible factors, the personality of the inter- 
preter and the nature of his group. Making the response of an interpreter a 
point of reference for the analysis of language has the effect of assigning a 
place of central importance in that analysis to the interpreter's personal 
characteristics and the social context within which he operates. 

A comprehensive investigation of a language would deal with signs and 
their interrelations; the designation of objects, properties, relations, functions, 
and situations by signs, expressions, and sentences; and the interpreter of the 
signs with the totality of his personal and social behavior associated with the 
use of the signs. Studies of language can, of course, vary in their comprehen- 
siveness. The terms “ syntactics," “ semantics," and " pragmatics " are em- 
ployed to distinguish three levels of comprehensiveness in language analyses. 
Syntactics is the study of language in which only the signs and their inter- 
relations are abstracted and analyzed. Semantics refers to language analysis 
in which signs and their designata are abstracted and investigated without 
reference to the interpreter or user. Pragmatics is the language study in which 
extensive reference is made to the interpreter or user of the language. 

Taking the sign, the designatum, and the response of the interpreter as 
three points of reference, we can characterize the three kinds or levels of 

imple relations. In syntactics the abstracted relations 
language study by three simp In semantics the relation between the sign and 
of signs to signs are analyzed. ns dy. In pragmatics the relation of 
its designatum is the main object of study. 


principal interest is that between sign and user. 

Syntactics has to do with what one woul 
fashioning of letters, spelling, constructing sentences, 
In mathematics, formulas and equations are oer pene 
are examples of deductive reasoning. Calculation 


i i ical expressions. . í 
fits has three sets of syntactical rules: rules which 
well-develop 


igns; ich indicate how signs 
i introducing new signs; rules which ind. l g 
aaier acy e d composed into sentences; and rules 


can be combined, place S GE from one or more other 
y i a sentence ; 
which guide deduction, the obtaining * hese three kinds of 


ences conform to t 
sentences. When expr 


d ordinarily think of as the 
and reasoning logically. 
; proofs and derivations 
pecial form of 


essions OF senti i 
d child, the behavioral event might be 
f interest to a psychologist 
erties might be shown by 


his reading of several pages of test Bene n d 
: es d of reading. ^^. d of another. A function might be 
might be the child's spe? d of one child with reed from differences in spelling accuracy 


comparing the reading spect 
established for predicting di 
for a large number of children. 


2For example, the object mig! 
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m 
syntactical rules, they are considered to be correct. bea nens a confor 
in some respect, they are considered to be incorrect in tha E P s — 

Semantics is concerned primarily with Saletan. sod dene 
designation can be distinguished. The lowest level is the a ahi to the 
an object and a sign which simply directs attention to or p 

ject. : -— 
E a higher level, properties of objects, relations between ier bert i 
functions are designated by signs or combinations of signs. cog 
designate relations, functions, or situations are called wrist -— 

At the lowest level of designation the assignment of signs to ere ond 
often arbitrary. These semantical definitions pose no special problem ie 
at the same time, add nothing to empirical knowledge. hine pend 
designations may be arbitrary, expressions and propositions or Pra 
porate them may be true or false.4 Qualities of truth and falsity can e ud 
mined only by experience and Observation. Success in acquiring emp 
knowledge is largely a matter of formulating true propositions. (Later we 
shall prefer using ** confirmable," a relative term, in place of " true," an 
absolute term.) "— 
At the higher and more complex levels of designation, the meaning 
assignment of signs, expressions, and sentences to designata such 9» pros 
perties, relations, functions, and situations is seldom, if ever, arbitrary.’ The 
designation presupposes some kind of correspondence or isomorphism 
between signs and their designata. Verification of the correspondence is again 
only the yield of experience and observation. In particular we wish to 


emphasize that the development of an elaborate syntax, which may be 
entirely correct within itself, has no necessary consequence for semantical 
verification. 


There are other special signs which connect or indicate relations between 
igns but do not themselves designate objects, properties, relations, or 
functions. Examples of these special signs are the English words: “ all, 


and,” "any," « are,” “ every,” * if" “is” and * not." There are still 
other signs which indicate 


the reaction of the user to the situation being 
described. In the following sentence, ** unfortunately " indicates the reaction 
of the author of the sentence, “ Unfortunately, everyone agrees that statistics 


Given three distinguishable Objects and the three | 


3i 

labels could be assigned as names for the Objects in an 

labels were then used consistently in later communications. 
‘The words, “ farmers," * milk," and * cows,’ 

arbitrarily, but in our everyday experience only 

Proposition. 


abels—RIL, ZUP, and NEM, the 
y arbitrary fashion as long as these 
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is misused, but few can agree on what specific practices actually constitute 
misuse." We shall not attempt here a discussion of the designata of these 
special signs which either connect other signs or indicate the user's reaction. 
Our reason is that the discussion would go beyond the reasonable scope of 
these brief, introductory considerations and would not be especially profitable 
for the student of psychological statistics. 

We have already stated that success in acquiring empirical knowledge is 
largely a matter of formulating true semantical relations but that verification 
itself depends on experience and not on the logical manipulation of signs. What 
is the nature of the extremely important and critical process of verification? 

The question does not have a simple answer. We can say, in general, that 
one must observe the semantical rules of the language in question. A 
semantical rule is one which determines the conditions under which a sign 
or an expression is applicable to a designatum. In other words, a semantical 
rule provides the means of establishing correspondence between signs and 
what they refer to. The problem of correspondence in the case of attention- 
directing signs is not difficult. The semantical rule is that the sign designates 
what is pointed to. Much more difficult is the problem of correspondence 
for signs, expressions, or sentences which designate properties, relations, 
functions, or situations. . ] 

In the development of logical empiricism, r4 VIEWS: OF the prés 
verifying propositions have been prominent. One view is that scienti c 
statements are verified by means of immediate experienses. That S tO GAY, 
propositions can be reduced to so-called primitive sentences os Ppa 
there can be no question. A second view Is that a See chec E only 
by incorporating it in a system with other propan peur se aay 
Propositions have been called protocol sentences and are de ned as ^ 
statements of observations as typically employed by natural scientists. 
third view is that the base of all empirical knowledge is the ultimate B 
of all propositions to attention-directing and pointing, a process calle 


substantiation. 

Carnap has tried to synth 
of distinguishing between the terms 
to propositions. Carnap maintaine 


esize these three views. He stressed the importance 
; * truth " and “ confirmation " as applied 
d that truth is an absolute concept and 


; ich vary with the progress of 
that confirmation is a relative one, degrees of whic eds 
any mue p also distinguished between directly jeg € 
Propositions. A directly testable propose" ; din nfirmed. An indirectl 
Observation, can be regarded as confirmed * = cut € am 
testable proposition is one which t$ tates by ox diceris spin 
tions which are related to it. The testing operations ! sigan assit dm 
position against observation and checking a prop! 


d. 
propositions which have already been confirme 


"See Joergensen (6). 
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Pragmatics, the study of the relation between signs and the user of signs, 
includes and depends on syntactics and semantics but goes beyond them. A 
language cannot be adequately described, nor can its functions be completely 
explained, on the basis of its syntax and its semantical relations. Language as 
behavior is intimately involved with the attitudes, values, and judgments of 
the users. Usages may develop because of their convenience for the user, 


their effectiveness in communication, or their demonstrated practical 
benefits, 


Common-sense considerations enter pragmatics in the supposedly sound 
and valid judgments underlying the thinking, reasoning, and argument which 
play very important roles in the practical affairs of men. It is certainly true 
that some of what passes for common sense is not sound and valid. It is 
possible, too, that part or all of what is actually sound and valid judgment 
could be shown to be so by syntactical analysis with respect to the deductive 
Processes of the language. Although common sense conclusions might, in 
theory, be reducible to formal deduction, it is quite impractical to do so as a 
general thing. In everyday life there is insufficient time to reduce all of one’s 
judgments to the syntactical level of evaluation. Furthermore, only a few 
highly specialized scholars could do the logical analyses in those instances in 
which they are possible. Most of us believe that many important problems 
can be solved and their solutions communicated satisfactorily in language by 
an application of sound judgment on the part of one or more individuals 
without recourse to a formal analysis; and we act accordingly. Consequently, 
it seems appropriate to study, in pragmatics, the important role of common 
Sense as it is exercised by the user of the language. 


Convention and habit are also important factors in language behavior. At 
any given point in time in any social group, certain ways of speaking and 
writing are taken for granted and used uncritically. If a member of the group 
does question a specific construction or idiom, he is often at a loss to explain 
It or account for it except as a convention. Even so, it would not be justifiable 
to conclude that conventional and habitual usages have completely arbitrary 


origins. They may have originated in the attitudes, values, and judgments of 
members of the social group of an earlier period. 


seis tpa] to the sentence of syntactics and the proposition of 
mantics is the expectation of pragmatics. An expectation is the response 


of an interpreter to a Proposition initiated by himself or another 


— Expectations are fulfilled or not fulfilled in the experience of the 
er. 


Psychological Statistics 


Psychological Statistics 


Psychological statistics, as we wish to consider it here, is a language 
consisting of mathematics, psychological terms and other technical English 
words, and words employed in formal English for speaking and writing. It 
is a language employed by psychologists in research, the objective of which is 
ge about the behavior of organisms.’ The signs of 
ude numbers, the letters of the Latin and Greek 


alphabets, mathematical symbols, and words. The designata of psychological 
statistics are behavioral events or products, their properties and relations, 
taken singly and in groups produced by sampling. The interpreter in psycho- 
logical statistics is the research psychologist as he responds to and makes use 
of the signs. - 

Our problem is to give an account of a specialized language by means of a 
natural language. The specialized language 1$ psychological statistics. The 
natural language is English. We must employ English to introduce and 
describe psychological statistics. The fact that the two languages overlap 
somewhat has both advantages and disadvantages. Common features of the 
two will sometimes facilitate our efforts at communication. On the other 
hand, important discriminations in psychological statistics are quic d 
made difficult by the relatively loose usage which characterizes the ordinary, 
natural language. 

We shall not make a great fu 
language and the natural languag 
we hope to proceed from everyday, clas 
kind of process of successive approximation, 
research psychologist. The transition from 
gradual and the dividing line between the two 
take it for granted that the transition will not 
that much backtracking will be cn PR 

We propose dividing the study of Phi a 
syntactics, semantics, and pragmatics. This P 
and explanation in a number of ape as studies of different dimensions of 

Syntactics, semantics, and DE of abstraction. In no instance does the 
the same language are ken aa perfectly discriminable disciplines. There is 

TO! ion yie > 
p pesar iban ncn overlap and ape spi plas rii 
one doe try very hard to make as AA ] probl th 
s not have to try le, treating as semantical problems those 
the cause for confusion. For example, hemselves linguistic abstractions ds 


x 3 i aaret 
designations for which the designat: ipie 
: ent, P. Ei 
"Behavior is defined here as meaning A e pot private; that is, it is o 
2 i , 
Part of a living organism. Behavior 
Study. 


the acquisition of knowled 
psychological statistics incl 


ss about this distinction between the specialized 
e. It may be sufficient to say simply that 
y, classroom and textbook English, by a 
to the statistical language of the 
the one to the other will be 
will often be imperceptible. We 
be without discontinuities and 


cal statistics into three parts: 
posal deserves amplification 


r a vocalization on the 
pen to objective 
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not a profitable line of activity for our purposes. It is true that signs them- 
selves might be taken as designata, or the syntactical relations among signs 
might be so considered, or states of the user, or pragmatical issues, or 
semantical issues, or properties of properties. But enough of this proliferation 
of abstract designata. Common sense tells us that, if we continued, we could 
soon reach a level of complete absurdity. 

How the study of psychological statistics can be divided into syntactics, 
semantics, and pragmatics is not immediately obvious upon first considera- 
tion of the subject, a circumstance which will very likely provoke in the student 
some skepticism as to the desirability of doing so. Our previous experience 
indicates that, at many points, the distinctions among the three separate 
studies can be made and maintained with ease and confidence on the part of 
the student. At a few points the distinctions may be difficult and may appear 
to be arbitrary. The advantage which derives from the many relatively clear 
distinctions far outweighs the disadvantage of the few unclear ones. 

What is the advantage of dividing psychological statistics into three 
separate studies? We can answer this question by considering the kinds of 
problems encountered in the general study of the subject and how these 
problems can be grouped under the three headings: syntactics, semantics, and 
pragmatics. We shall see that these three groups of problems possess clearly 
distinguishable features and require for their solutions quite different 
approaches. The important advantage of the analysis, then, is that it brings 
into sharp focus the different kinds of problems and the peculiar nature of 
the solution appropriate to each kind of problem. 


EXAMPLES OF SYNTACTICAL PROBLEMS 


. Recall that syntactics is the study of the relations of signs to signs. Although 
in psychological statistics we have problems of English syntax as well as 
mathematical syntax, we shall assume that students possess an acceptable 
level of competence in the former and that it will, therefore, be profitable to 
focus upon problems in the latter. Listed below are examples of problems 
which belong in syntactics. In these examples no attempt has been made to 
stay within the prior experience of the student, a circumstance which should 
not alarm him for issues of this kind will be dealt with in later chapters as 
it becomes necessary and appropriate to do so. 


1. Perform the arithmetic operations indicated below. 


42 x 21 +31 x 56 + 29 x 38 
2. Given the equations, 


By + Borie = lia, 
Birnie + Ba = F23, 


solve for B, and f, in terms of r,,, ris and rss. 
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3. Transform the equation, 
Y' = b,,X + (Y — by. X), 


by translating the axes so that the origin is at the point, (X, Y). 
4. Find the value of M, which minimizes the value of the expression 


5 (X: — M 
i=1 


5. Given a discrete population distribution for the variable X, find the 
mean of the sampling distribution of means of random samples of two values 
when the sampling involves replacement. 

The five problems given above are entirely devoid of empirical content. 
One has only to know the rules for handling the symbols and the solutions 
can be readily obtained. 

No doubt some of the five syntactical problems given above will be 
difficult for individuals who have a limited background in mathematics. In 
the face of such difficulties, beginning students in psychological statistics 
often imagine that they would be helped if they were given examples of 
applications. The fact is that applications almost always introduce new issues 
and complexities without resolving any of the original ones. Students who 
have difficulty with any of these problems will only overcome that difficulty 
by applying themselves diligently and with concentration to the syntactical 
rules involved in the solutions. It is possible, of course, that they need a 


review of mathematics or further training in that subject. 


EXAMPLES OF SEMANTICAL PROBLEMS 


Consider next the area of semantics in psychological statistics. Recall that 
semantics is the study of sign-object relations in terms of the confirmability 
of the designations. Again we must acknowledge that the nontechnical verbal 
portions of statistical language give rise to semantical problems riis in the 
case of syntactics, We shall assume that students use everyday English iie 
reasonable regard for the semantical rules and that the important roa iere 
is sign-designatum correspondence in two areas: measurement an statistica 


inference. 


While a number of different levels of sign-designatum relations can be 


conceptualized, we shall limit our consideration here to one of I 
In our discussion, what is designated is not itself a "€ a com : “aie 
signs, or à property of signs. The designatum is, seer ys 2 y: a , 
biological psychological, or social attribute cat ine ivi a toe mice 
That is to say, what is designated is some ter of behavior guage 
or any part of language, ion pras x id ds of designations can be 

For the behavioral scientist, two important ` in dh recie ^ 
distinguished. First there are the designations Imp ied by , the 
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process of quantifying observation. Psychological measurement is the designa- 
tion of the properties of behavioral events by means of numbers. Second there 
is the designation of the error variability in measurements. For example, 
specification of the sampling variability in measures of behavioral events is 
the designation of an empirical sampling distribution by means of a mathe- 
matical function. 

These two kinds of semantical relations, measurement and the specification 
of error variability, should be preeminent in the study of psychological 
statistics. because the confirmability of these designations is of critical 
importance for the ultimate use of statistics in research. The behavioral 
scientist's chances of success in acquiring knowledge through his research are 
increased considerably if the research is carried out under conditions of 
confirmed measurement and confirmable Specification of error variation. One 
could, of course, extend the notion of confirmability to the final results of 
research. Since an account of the use of statistics and the interpretation of 
results in research involves to a considerable extent comment on the users, 
we shall not treat the problems of confirming research results as semantical 
problems. Instead, we shall limit our concern with confirmability in semantics 
to the two intermediate issues: measurement and the specification of error 
variation, 


Listed below are five problems in psychological measurement. 

1. A student Participates in the javelin throw in the track and field events 
at his school. His best throw under the standard conditions for the event is 
a distance of 35.45 meters, Opposite his name in the record of the meet there 
1S written, as a measure of his performance on that occasion, “ 35.45 m." 
g numbers to behavioral events be justified ? 
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scales. The four scales are intended to measure important dimensions of the 
applicants’ behavior. A scale is represented by a horizontal line on which are 
located five, equally spaced points, numbered “ one " to “ five " from left to 
right. Each point is also labeled with one or two words intended to assist the 
rater in making his judgment. The twelve ratings from four scales and three 
judges are summed for each applicant. The sum constitutes the applicant's 
combined rating. Does this sum qualify as a measure of the characteristics 
of an applicant, as intended by the interviewers? 

5. A clinical psychologist obtains 100 items, each item consisting ofa 
statement which may be used by a mother in describing her behavior with 
respect to an infant son. He instructs a client to order these items in terms of 
the way they describe her relations with her son, the order running from those 
** most like " herself to those “ least like ” herself. He then asks her to order 
the items a second time as an ideal mother would have performed the task. 
The two orderings are then correlated. The product-moment coefficient of 
correlation for this particular client is 0.55. The psychologist reports the 
coefficient as a measure of agreement between the client's self-concept and 
her ideal self-concept. Is this measure of agreement confirmable ? 

The five semantical problems posed above all have to do with the observa- 
tion of behavior and ways of quantifying observation. In each example 
numbers are assigned to behavioral events. The purpose of the assignment is 
to characterize and record properties of the events. One certainly cannot 
assume that the assignment of numbers by any arbitrary system whatever 
will automatically achieve the goal of characterizing the properties of the 
events, Furthermore, there is nothing in mathematics or any other language 
which makes it possible to deduce evidence of confirmation or disconfirma- 
tion. Confirmation can only be achieved by experience. 

A comprehensive treatment of psychological measurement = go far 
outside the scope of a single text and would overlap the d ere 
courses in the conventional course offerings ofa rre o acis ho Dg 
What we shall attempt in this book is a brief outline of the Ed pes i 
measurement. This discussion of the objective cardi it whe A r 
degree of correspondence between properties of numbers anc prc 4 ssi 

à d by the numbers can be determined will be given in Chapter 
vri ceca d present, we shall say only that these conditions have to 
ehe ibili d predictiveness of the assigned numbers. 
do with the reproducibility and P afl eres. There 
ished with our examples of semantical problems. " 

We have not yet finishe i ing the specification of error variability, of 

is the other important class involving p 


which three examples are given below. 

1. A student has 10 coins. Thet a 
identified as ‘ head " and “ tail." He eem 
records the number of heads. He repea 


rd 
until he has 1,200 counts. He notes and reco 


o sides of each coin can be readily 
s the 10 coins, then counts and 
ng and counting procedure 
s the number of times he 
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counted zero heads, one head, two heads, and so on. He then converts these 
frequencies to proportions of the total number of counts. 
He is then told by his teacher that the terms in the expansion of the 
binomial, 
(p + 9)", 


when p = q = 0.50, will approximate the proportions he obtained. 
What justification is there for this prediction? 


2. A machinist's apprentice is tested for his accuracy and precision in 
measuring the diameter of a standard metal rod using a micrometer. He is 
required to make a lengthy series of measurements in which the standard is 
alternated with other rods whose diameters vary somewhat but not enough 
to be detected by a visual inspection. His results for the standard rod are 
summarized in a frequency distribution. It is observed that this distribution 
is roughly symmetrical around a central value with relatively few values at 
either extreme and a relatively large concentration of values in and near the 
middle. A statistician sees the data and tries to fit a normal curve to them. 
The fit is considered quite good. He makes note of the fact that the apprentice's 
variation in performance is “ normally distributed." Is this correspondence 


an accident or is it evidence of the operation of some general law of 
behavior? 


Tequency distribution of the 1,000 means is constructed, 
| icts in advance that the form of the distribution of means 


ely that of a normal 
outcome will be? curve. How does he know what the 


The semantical problem of s 
will be discussed at some length i 


probably suggests 
though this inter- 
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meaningful notions if we did not at once caution the student regarding this 
possibility. 

The danger is that a student, when he comes to pragmatics, may think that 
all he has to do is memorize a list of statistical methods and corresponding 
applications. At a very low level of sophistication and understanding his task 
might be so construed. But there is much more than imitation in employing 
statistics in research and consequently much more than memorizing in 


pragmatics. 

Let us illustrate with three issues. 

1. Summarization of data is one application of psychological statistics. 
One might look upon this summarizing process as a condensing of the 
original data to a form which makes them more convenient to handle but 
which lacks the original detail. There are, of course, degrees of condensation. 
One could reduce a mass of data—let us say 10,000 observations—to a single 
or one hundred, or whatever number one 
chose. And what does one choose? What degree of condensation is desirable? 
How much detail can one afford to lose? The answers to these questions will 
depend on a complex of considerations arising out of the practical situation 
and relating to current practices among research psychologists. 

2. A common problem for a research psychologist is the choice of an 
equation to express a relation between two variables. Equations vary from 
simple to complex. Ideally an investigator would want to use the equation 
which was correct or best, in some sense, regardless of complexity. Practically 
he must take account of the inconvenience and expense involved and he will, 
therefore, often accept a simple equation as an approximation and reject an 
intractable, but possibly more exact, one. Unfortunately there is no simple 
rule for comparing a loss in predictiveness with gains in convenience and 
economy. Consequently, there is a fair amount of disagreement among 
psychologists in the choice of approximations of various kinds. Although one 
may justifiably deplore this state of affairs, there is no point in insisting on 
perfection in these matters. It is in the very nature of research that one must 
be willing to employ approximations even when they are subject to somewhat 


arbitrary selection. 


3. In examining the result 
have produced the result. We agree to a 


i i i nn 
will ensure our being misled by error o r 
in the long run. Why do we resign ourselves to being mistaken in our evalua- 
tions? Why do we choose one-in-twenty as the ratio and not some other? 
What uncertainty is revealed in this final projection into the future—“ in the 

ions involve a mixture of common 


»9 The answers to t g " : : 
]ong run d experience. They will be amplified in later discussions. 


sense, convention, an 
A reference was made earlier to the correlatives: the sentence of syntactics, 
the proposition of semantics, and the expectation of pragmatics. We say that 


value, or to two values, or three, 


s of an experiment, we recognize that error may 
standard evaluation procedure which 
o more than one in twenty occasions, 
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a sentence is correct or incorrect, a proposition is confirmed or disconfirmed, 
and an expectation is fulfilled or not fulfilled. What constitutes the fulfilment 
of an expectation? The most direct answer is that, if the researcher employs 
correct syntax and confirmed semantical relations, his immediate expectations 
are fulfilled in the acquisition of empirical knowledge. 

Although the acquiring of knowledge is a prime goal of the researcher, 
there is more to be said about the user of statistics and his expectations, in a 
broader sense. Students are confronted with what they often perceive as a 
burdensome chore in studying psychological statistics. What are the con- 
comitants and products of this learning experience, in addition to the very 
fundamental development of research skills? 

The first important outcome of studying and mastering psychological 
statistics is that one comes to understand research issues and to know that 
one understands them. Understanding and knowing that one understands 
these issues are necessary preconditions for intellectual independence and 
authority, for successful self-direction, and for a personally healthy and 
socially acceptable degree of confidence in one's work. 

The second important outcome is that one comes to be understood by one's 
colleagues and to command agreement and assent, on an intellectual level, in 
the resolving of research issues. 

The third important outcome is that one becomes a better teacher of 


Psychology and psychological research methods in the classroom, the 
laboratory, and the field. 


Looking Ahead 


In anticipation of what lies ahead, let us consid 
features of the plan for dividin 
semantics, and pragmatics. 


er some of the prominent 
g psychological statistics into syntactics, 


The plan does not involve a simple division of the subject matter into three 
parts, to be presented in three successive sections of the book. There is some 
appropriate and necessary alternation of the three approaches. Furthermore, 
they do not receive equal emphasis and elaboration, 

Since the syntax of arithmetic and algebra is very important in psycho- 


logical statistics, exercises and Suggestions for review of these elementary 
mathematical subjects are given in Chapter 2. 


Another syntactical topic, the symbolism of sums, is given a fairly detailed 
treatment in Chapter 3. 


Before we can continue with the outline, we must introduce a few terms. 
It was pointed out earlier that the word “ 


statistics," construed as singular, 
indicates a branch of knowledge. The mea 


ning of the word “ statistic," con- 
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strued as singular, deserves comment. We shall not use the word “ statistic ” 
without a qualifying prefix. Any value computed from a set of measures will 
be called a '* d-statistic " because it is, to some degree, descriptive of the set. 
By definition, a d-statistic does not involve the notion of error and, in a 
formal sense, nothing can be said about its error. The plural form of 
d-statistic is obtained by adding “ s 

Any value computed from a set of measures subject to random variation, 
as in computing from the measures in a sample drawn from a population, 
will be called an “ i-statistic " because certain inferences may possibly be 
based on that value. An i-statistic has descriptive value with respect to its 
sample; a d-statistic does not have inferential value for a population. The 
plural of i-statistic is obtained by adding “ s." 

While the qualifications represented by the prefixes, * q- " and “i,” are 
widely known and understood, these prefixes are not in general use. The 
distinction between two corresponding branches of systematized knowledge, 
descriptive statistics and inferential statistics, is, however, frequently made by 


statisticians. 


The thirteen chapters from 4 through 16 are classifiable, as follows: 


Chapter Topic 
Syntactics 4 A class of values called moments 
7 A class of values called product-moments 
8 Partial and multiple correlation 
10 The analysis of variability in classifications of 
numbers 
11 Sampling theory 
12 Sampling distributions 
Semantics 5 Measurement 
13 Practical random sampling 
Pragmatics 6 Descriptive uses of moments 
9 Descriptive uses of product-moments 
14 Statistical inference in research 
15 Tests of significance in experimental research 
16 Tests of significance and estimates of parameters 


in non-experimental research 
actical and pragmatical issues relating to the 
application of statistical inference to frequency classifications. 

Note that two chapters, 6 and 9, are devoted to descriptive statistics, while 


four chapters, 14 through 17, deal with inferential statistics. 

Because we believe that learning is facilitated if the student keeps in mind 
the division of the subject into syntactics, semantics, and pragmatics, each 
chapter Or section devoted solely to one of the three kinds of study is so 


designated by an appropriate sub-heading. 


Chapter 17 deals with synt 
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ARITHMETIG 
AND ALGEBRA 


Syntactics | 2 


d in statistics by their failure to recall, 


Students are sometimes handicappe 
very elementary principles or rules in 


or to grasp the significance of, certain 
arithmetic and algebra. It is our purpose here to provide a brief review of 


arithmetic and algebra with focus upon those principles and rules which have 
the most direct bearing on the work to be undertaken later. It is not our 
intention to present a course in arithmetic and algebra. Students who have 
had no previous training in algebra should either set out upon a course of 
self-directed study using a standard text or enroll in a first course in college 


algebra. 
Exercises will be provided at appropriate points in the review. The student 
should make full use of these exercises for the evaluation of his progress and 


the diagnosis of his difficulties. Answers will be given to the exercises bearing 
odd numbers but not to those bearing even numbers. 


On Arithmetic Operations 


Addition is indicated by the sign,“ + “A 
8+9 


Subtraction is indicated by the sign, “ — 
7—5 

1One al i dents might find helpful is J. B. Rosenbach, E. A. Whitman, 

B. E. meme M Se hitman, College Algebra (Boston: Ginn, 1958). A book in 

T EIE dents have done their reviewing of the funda- 


Which man d education stu » 
mentals ui de re tag Helen M. Walker, Mathematics Essential for Elementary 
Holt, 1951). 


Statistics, rev. ed. (New York: 
17 
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Multiplication can be indicated in three different ways. 
5 x 3or 5(3) or 5:3 
Division can be indicated by either a slant, “ / ", or a bar, * — ". 
9 
9/2 or j 


The order in which numbers are added does not affect the sum. For 
example, the summation, 
3--4--84 6, 
is usually performed in the following steps which, under ordinary circum- 
stances, would not be written out: 


3 + 4 is equal to 7, 
7 + 8 is equal to 15, 
15 + 6 is equal to 21. 


Note that the numbers were accumulated from left to right. Arranging the 
Same numbers in a different order, we have 


8406-3 

Accumulating, as before, from left to right, we take the following steps: 
8 + 4 is equal to 12; 
12 + 6 is equal to 18, 
18 + 3 is equal to 21 


Which is the same sum obtained from the original order, Any other order of 
accumulating the same numbers would give the same answer. 

Grouping of numbers in additi 
parentheses to indicate arbi 
the final summation takes 


on does not affect the sum. If we use 
trary groups whose sums are to be obtained before 
place, we can write: 


8--11 1645 
as 


(8 + 11) + (16 + 5) 
which is equal to 


19 + 21 
which is equal to 


40. 


We can write the Same summation as: 


(8 + 16) + (11 +5) 
which is equal to 


24 + 16 
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which again is equal to 
40. 
Thé order in which numbers are multiplied does not affect the product. 
Given the problem, 
5x8x3x4, 
we can multiply from left to right. 
5 x 8 is equal to 40, 
40 x 3 is equal to 120, 


120 x 4 is equal to 480. 


These same numbers can be written and multiplied in a different order, as 


below, 
3x8x4x5 
but the final product is the same. 
3 x 8 is equal to 24, 


24 x 4 is equal to 96, 


96 x 5 is equal to 480. 


n multiplication does not affect the final product. 


Grouping of numbers i i f 
heses to indicate the operations which are to be 


Again we can use parent 
performed first. 


3x8x 10x 6 
is equal to 
Q x 8) x (10 x 9 
is equal to 
24 x 60 
is equal to 
1,440 


which is the same result we would get if the numbers were grouped as 


G x 10) x (8 x ©) 


which is equal to 
30 x 48 


is equal to - 


ccur in the same expression, it is 
is performed first. In the following 
fore the addition: 


When addition and multiplication o 
necessary to have rules regarding which 
sum of products, the multiplications are performed be 


3x548x6-47x7 
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is equal to 


15 + 48 + 49 
is equal to 


112. 
With respect to the expression, 
5(9 + 7), 


which contains addition and multiplication, there are two correct solutions. 


In the first solution the addition within the parentheses is performed before 
the multiplication, That is, 


5(9 + 7) 
is equal to 
5(16) 
is equal to 
80. 


In the second solution each number within the parentheses is multiplied by 


five and the products are then added. This procedure is an example of 
distribution of multiplication. Thus 


5(9 + 7) 
is equal to 
5x9+5x7 
is equal to 
45 + 35 
is equal to 
80. 


There are also two correct solutions for the problem, 
(2 + 8)(6 + 5) 
In the first solution the additions within parentheses are performed before 
multiplication, Thus 


(2 + 8(6 + 5) 
is equal to 


(10)(11) 
is equal to 
110. 
In the second solution multiplication js distributed and then addition is 
performed. That is, 


(2 + 8)(6 + 5) 
is equal to 


?X62x5485618»5 
is equal to 


12 + 10 + 48 + 40 


42 


Signed Numbers 2 


is equal to 
110, 

which is the answer obtained in the first solution. 

The order and grouping rules for addition apply also to subtraction and 
to combinations of both operations. 

The order, grouping, and distribution rules for multiplication apply to 
division when the division is expressed as equivalent multiplication by 
employing the reciprocal of the divisor. (The reciprocal of a number is “ one ” 


divided by the number.) 
Examples of division expressed as equivalent multiplication: 


3 1 
23 £13 
ais equal to 1 (3), 


1 
er» is equal to 3 (4 4-5, 


1 
S T$ is equal to (3177 (3 + 9. 


Caution: The use of the sign, ". to indicate division can yield 


ambiguous expressions where more than one division is to be performed or 
Where both multiplication and division occur. We shall not use the symbol. 
Note that the position of the bar or the slant 1s critical in expressing 


divis; 
IVision, Observe that 


ae : 31 toS x Sisequatto t 
5/6 is equal to 5/6 x 4 is equa 3X4 20 
Whereas 
3 
e 4 TEE anis 
EE is equal to : x x is equal tog X 5 x 7 is equa 9 125 


Signed Numbers 


i i i “4 or“ — ”, precedin 
^ signed number is a number with a sign, ds a rte bar 5 
1 em siti u 2 
^ number preceded by " + 7 ponar 4 ;o called directed numbers 
—"isa ; ber. Signed numbers are also ca 
negative number. 
9r algebrai 
ene i i isregard the signs 
d ot ng withisigaed numb repere et Gk To do 
and consid lled absolute values of the sig' i 
— m with the numbers as if they 


80 we qr : n operations 
o erform OP pem M 
Were p the signs and p value of +9 is 9; the absolute value of —4 


veni ^d 
acr , rare “fs ver K 


Unsigned. Thus the absolute p 
ye w, 
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is 4. The absolute value is also called the arithmetic value or the numerical 
(ad The symbol for absolute value is two vertical, parallel bars written as 


|—6]| , which is equal to 6, and 
[+8 | , which is equal to 8. 

An expression involving numbers and signs, such as 
T3—142—4-—24 6, 


can be interpreted correctly in any one of three ways. The signs can be taken 
as referring to the operations of addition and subtraction involving unsigned 
numbers, or to the operations of addition and subtraction involving all 


positive numbers, as indicated below, 
(+3) — (+1) + (+2) — (44) — (+2) + (+6), 


or to the addition of positive and negative numbers, that is, as 


(+3) + (=D + (42) + (74) + (22) + (+6). 


Rules have been established 


for the addition and subtraction of signed 
numbers. They are listed below. 


1. To add numbers with like 
Sum with the common sign. 
2. To 


with positive Signs, add the absolute v 


signs, add the absolute values and prefix the 


other, change the sign of the 
d and proceed as in addition. 


n established for the multi 
are given below, 
l. The product or quotient of 


number to be subtracte 
Rules have also bee: 


Plication and division of 
signed numbers, They 


numbers are often omitted but und 


In multiplication, when the t 
interpreted as the repeated addi 


€rstood to be in effect, 


WO signs are Positive, the product may be 
tion of a positive number. That is, 

6(3) is equal to 3 33.3 35:3. 3-3; 
When one Sign is positiv 


€ and the other 
interpreted as Tepeated a 


ddition of a negat 
7(—4) is equal to (—4) + ( 


is negative, the product may be 
ive number, Thus: 


79-F C 4 (4) 4 (345 4. (—4 + (—4). 
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w ! x 
eras both signs are negative, the product is defined as positive, thereby 

ntaining consistency in the distribution of multiplication. Consider the 
problem, 

—3(7 — 2): 
Eos should be two correct solutions. In one we subtract within the paren- 
eses before multiplying. Subtraction gives us 


—3(5) which is equal to —15. 


ex the other solution, we distribute the multiplication by —3 over both 
umbers within parentheses, obtaining 


(—3)(7) + (7-372. 


Tie first product has factors with unlike signs and is, therefore, negative. The 
econd product has two factors with negative signs. This second product 
must be given a positive sign to yield a final answer of —15 and be consistent 
with the first solution. That is, 

(—21) + (+6) is equal to —15. 


are justified by expressing the division as 


The sign rules for division 
plying the sign rules for multiplication. 


equivalent multiplication and ap 


Geometric Interpretation of Signed Numbers 


tance on a scale measured in one 


A signed number is interpreted as a dis 
ented below by the horizontal line 


of two directions. Consider the scale represi 

marked off in equal segments. 

wos j ae ee ae T ee: 
=f <6 5-4 3 Sl 0 +1 42 43 +4 +5 46 +7 +8 


Notice there is a point labeled “ 0 ” and other points numbered consecutively 
in each direction, that is, to the left and to the right of the zero point. The 
Point labeled with zero is called the origin. Points to the right of the origin 
are labeled with positive numbers which increase in value as one reads the 
scale from left to right; points to the left of the origin are labeled with 
Negative numbers which decrease in value as one reads from right to left. A 


Signed number expresses a distance in either the positive or the negative 
direction and locates a point on the scale. The correspondence between a 
Signed number, X, and a point on the scale, P, is acknowledged by calling 
the number the coordinate of the point and the point, the graph of the number. 

The length of a segment of the scale between the origin and point P, 


disregarding the direction, corresponds to the absolute value of X. 
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Geometric Interpretation of the Combining 
of Signed Numbers 


An expression containing positive and negative numbers can be referred 


to the scale given above and interpreted as locating a point on the scale. For 
example, the expression, 


T3—132—4--I-4$, 
is interpreted, as follows. . ps . . 
Start at zero on the scale and measure three units in a positive direction, 
then one unit in the negative direction, then two positive, four negative, one 
positive, and finally two positive. Movements along the scale are diagrammed 


below. They stop at the point, +3, which corresponds to the value of the 
expression after all the operations have been carried out. 


Se pop E RE a 
8$ —7—6-5—4—3—2.] 9 +] 


Fractions 


A fundamental principle of importance in working with fractions is that 
multiplying (or dividing) the numerator and the denominator of a fraction 
by the same quantity does not change the value of the fraction. 

Example: 


3, 5(3) 8(3) 
Is equal to —— į ss 
7 1s eq 5(7) is equal to 8(7) 
Caution: Adding or subtr 
the denominator does chan 


In adding and subtracti 
then add the numerators, 


acting the same quan 
ge the value of a fracti 
ng fractions we obtain a 


tity in the numerator and 
on. 


common denominator and 
Example: 
ls 5 6 
3 + 5 iS equal to ist Ge 


is 1 11 
15 18 equa to 15 
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In multiplying fractions we obtain the product of the numerators and the 
product of the denominators, and write the former over the latter. 


Example: 
2 


X = is equal t 3 
5X5 qual to 4; 


In dividing fractions we invert the divisor and multiply. 


Example: 


; l 1 Sia T 5 
is equa to4 X ,1sequa 99 


1/3 
3/5 


Algebraic Symbolism 


oon discover, and are often exas- 
riting symbolism vary considerably 
The brief description of writing 
a universal, invariant standard. 


Students of psychological statistics s 
perated by, the fact that practices in w 
from one author or teacher to another. 


practices given below should not be taken as 
It is well for students to develop, at the very beginning, an attitude of adapta- 


bility and flexibility where symbolism is concerned. One of the chores a 
Statistician takes for granted as part of his lot in life is the constant adjust- 
ment and readjustment of his frame of reference for the interpretation of 


Symbols as he goes from one text or article to another. 
With these cautions a matter of record, we shall proceed to describe 
certain rather general but not necessarily fixed practices in writing symbolism. 


Letters are used, in a variety of ways, to represent numbers. 
Example: Let X represent any number in a set or class 


We use different letters for different sets or classes. 
nt any number in one set; 


of numbers. 


Example: Let X represe let Y represent any 
number in another set. 


ripts are attached to letters to permit identification of 


Numerical subsc 
particular numbers within a set. 
fers to the third number in a set; Y; refers to the seventh 


Examples: X, re : 4 
ment of numerical subscripts may be 


number in another set. The assign 
arbitrary. 


Letter subscripts also are employed for purposes of identification. 


ith number in a set, where i is any integer from 
ber computed, by means of a certain formula, 
refers to a number computed, 


" Examples: X; refers to the 
1” to ** n.” S, refers to a num 
from the numbers in the X set. The symbol, Fev 
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by means of a certain formula, from the numbers in the .Y set and the Y set. 
Identification is also achieved by means of superscripts. 


Example: We might let Y' (read * Y-prime ") represent any number in one 
set and Y represent any number in another related set, as when each value 
of Y' is computed from a corresponding value of Y by means of a stated rule. 


We sometimes need seconds, such as Y" and Y”, as well as primes. 
Capital letters and small letters serve different purposes. 


Example: We might let Y represent any number in one set and x represent 
any number in another set obtained by applying a transformation to each X. 


Small letters from the first part of the alphabet often represent constants 
while letters, capital or small, near the end of the alphabet often represent 
numbers in sets as indicated earlier. 


Examples: We might let a Tepresent a particular number whose value 
remains unspecified and Y Tepresent any number in a set of numbers whose 
values may vary but also remain unspecified. 


On occasion a letter is selected by an author or a teacher because it is the 
first letter of some descriptive word or term in a problem but this practice 
Is not always possible or desirable. 

Thus far we have referred only to letters of the familiar Latin alphabet. 
The demands of our statistical language force us to go to the Greek alphabet 


for additional symbols. The Greek alphabet is given below. Capitals are on 
the left; small letters are on the right. 


Letters Names Letters Names Letters Names 
A, « Alpha Lu lota P, p Rho 

B, B Beta K, « Kappa 2, o Sigma 
Dy Gamma 4, À Lambda T, T Tau 

4, 8 Delta M, p Mu Y, v Upsilon 
E, « Epsilon N, v Nu o, 4 Phi 

Za K Zeta &, é Xi X, x Chi 

H, n Eta O, o Omicron YW, y, Psi 

6, 0 Theta Il, n Pi Q, w Omega 


e to that other symbol. 


> is called an operator, Com- 
mon practic i i i i 
p es and rules in su on are discussed at length in 
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As revealed by the examples given below, the operations of arithmetic can 
be indicated with letters representing numbers. 


Addition: X+Y+Z 
Pp +4 
n+l 
X+a 
Subtraction: x —y 
N-1 
Y—c 
a -b—c—d 
Multiplication: XY (read “ X times Y”) 
aX 
3XY 
a(X 4- Y) 
X/Y (read “ X divided by Y ”) 
alX 
(X + Y/N 
(X —a)/5 


Division: 


in mind that an indicated sum, difference, product, 


It is important to keep 
presents a number which 


Or quotient, no matter how complex it may be, re 
can be employed in further operations. 


Equality and Inequality 


The symbol of equality, =, is read “ is equal to." It is used in stating that 


two expressions represent the same number. 
One kind of equality statement is the identity. A 


3(x — 2) 23x — 6 


n example is given below. 


This statement qualifies as an identity because the equality holds for all 


values of x. 
A second kind of equality statement is the equation, an example of which is 
3X +5 =29. 
hed from an identity because there are values for 


f equality does not hold. 
tement is the formula, which states that the 
s the number represented 


This equation is distinguis 
X for which the statement o 


A third kind of equality sta 
number represented by one symbol is the same à 
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by an expression involving one or more other symbols. The following state- 
ment is an example of a formula: 


Z = (X — M,)/S,. 
The symbols of inequality are “ < ", which is read “ is less than," and 
>”, which is read “ is greater than." An inequality symbol appears in a 
statement that the number represented by one expression is less than (or 


greater than, as the case may be) the number represented by another 
expression. 


Em 


Examples (with interpretations): 


mon m is less than n 
P « 0.05 P is less than five hundredths 
ab à is greater than b 
FS: Figg F is greater than F5 

0.01 — P < 0.05 


0.01 is less than P which is less than 0.05 
The symbol for nonequality, +, is read “ is not equal to." It is employed 


in stating that the number represented by one expression is not the number 
represented by another expression. 


Example: In general, 


XY, XY, A(X, + XY; + Y). 
The following exercises are provided to give opportunity for practice in 
writing algebraic symbolism, 


Exercise 2.1 

Let a represent a Particular number. Write the expression for 
1. twice the number 

. the number minus 5 

the product of the number and itself. 

the number divided by 10 

- the reciprocal of the number 

. 8 divided by the number 

. the number multiplied by its reciprocal 
the quantity, a Plus 5, divided by the quantity, a minus 4 


the quantity, a minus 3, multiplied by the quantity, a plus 8 
the quantity, a divided by 2, multiplied by 9 


O00 no0t0 RUM 


Answers to Exercise 24 


1. 2a Dugas 

3. aa 4. a]10 

5. l/a 6. 8/a 

7. alla) = 1 8. (a45 i 
9. (a — 3a + 8) 10. (a2) sali 
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Exercise 2.2 


Let c represent a particular number. Write the expression for 


1. 
. the number plus 10 

. the number multiplied by itself 
. the ratio of the number to 100 
. one divided by the number 

. the number divided by 10 


SeCmrIAAWAYWN 


the product of the number and 5 


the reciprocal of the number multiplied by twice the number 
the quantity, c minus 3, divided by the reciprocal of c 
the quantity, c plus 3, multiplied by the quantity, c minus 3 


. 20 times the reciprocal of the number 


Exercise 2.3 


Let a represent a particular number and let Xi, 
a set. Also let X; represent any number in the set where i ma 


Xa, Xa, and X, be four numbers in 
y be any integer from 


| "1" to 4 
Write: 


9 20 n RUNM- 


. the sum of the numbers in the set 
. a times any number in the set 


the product of a and the first number 
the product of a and the second number 


. the third number multiplied by a 
. a times the fourth number 


the sum of the products written in (3), (4), (5), and (6) 


. a times the sum of the four numbers 

. the sum of the four numbers divided by 4 

. any number in the set minus a 

. any number in the set divided by a 

. any number in the set multiplied by the reciprocal of a. 


Answers to Exercise 2.3 


- Xia 


Xi +X, Xit Xs 2. aX; 

ax, 4. ax, 

aX, 6. aXi 

. aX, + aX, + aXs + aM 8. aX + Xs + Xs + XQ 
. (Xi + Xa + Xa X9/4 10. X; — a 

p Ner s 12. (1/0) X; 


Exercise 2.4 


, Letcrepresenta particular number 
in a set, Also let Y; be any number in 


and let Yı, Ys, Ys, Ya and Y, be five numbers 
that set. We define i as taking any integral 


Value from ** 1 " to “ 5 " inclusive. 
Write: 


T 
25 


the sum of the numbers in the set 


the sum of the products obtained by multiplying each number in the set by c 
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c times the sum of the numbers 
the sum of the numbers divided by 5 
any number in the set minus the reciprocal of c 
. any number in the set divided by c 
. any number in the set multiplied by the reciprocal of c 
. the sum of the quotients obtained by dividing each number in the set by 5 
. the sum of the products obtained by multiplying each number in the set by 
the reciprocal of 5 
10. any number in the set minus the quantity, 3 times c 
11. any number in the set minus the quotient obtained by dividing the sum of 
the numbers by 5 


12. the sum of the five numbers multiplied by itself. 


00 MO 0 BUD 


Exercise 2.5 


Let a represent a particular number. Let X; and Y, be a pair of numbers, X, and 
Y; be a second pair, X, and Y, a third pair, and X; and Y, a fourth pair. Also 


let X; be the ith number in the X set and Y; be the corresponding number in the 
Y set. 


Write: 
. the product of X-sub-one and Y-sub-one 
- X-sub-two times Y-sub-two 
- Y-sub-three multiplied by X-sub-three 
- the product of the members of the fourth pair of numbers 
. the sum of the four products obtained in (1), (2), (3), and (4) 
. a times each of the four products 
- a times the sum of the four products 
. the sum of X 
9. the sum of Y 
10. the product of the sum of X and the sum of Y 
Compare your answer to (10) with your answer to (5). Are they equal? 
11. the product of the members of the it pair of numbers 
12. a times the product of the members of the ith 


Answers to Exercise 2.5 


900 10v ta f 2 M o— 


pair of numbers. 


l. XY, 2. XY, 
3. AY, 4. X Y, 
AGRYCRXG oe XY. XQ. 6 ax, 
aX,Y, 
aX;Y, 
ax, Y. 
s Ed AY OY. XQY) LAHNA X4 X. 
-ft Y 4 Y 4 Y, 10. (X, + X, + X, + X) 
m. (OY; + Y; + Ya + Y);No 
11. X,Y; 12. aX,Y, 2 3 a) 


Exercise 2.6 


Let c be a particular numb 
er. Let X; be the jth number i ; 
let Y; be the Corresponding number in another set MSN DIVERS 


Exponents ai 


Write: 
the product of the numbers in each pair 
the sum of the five products obtained in (1) 
c times each of the five products obtained in (1) 
the sum of the five products obtained in (3) 
c times the sum obtained in (2) 
Compare your answer to (5) with your answer to (4). Are they equal? 
6. the sum of X 
7. the sum of Y 
8. the sum of X divided by the sum of Y 
9. the ratio of X to Y for each of the five pairs 
10. the sum of the five quotients obtained in (9) 
Compare your answer to (10) with your answer to (8). Are they equal? 
11. the sum of the five products obtained in (1), divided by 5 
12. the difference between the members of each of the five pairs 
13. the sum of the five differences obtained in (12) 
14. the sum of the five differences obtained in (12), 


vee 


divided by 5 


Exponents 


d together, each is called a factor of the 
product. When more than two quantities are multiplied together, each one 
or the product of any number of them is also called a factor. 

al number symbol written as a superscript 
duct of identical factors. The exponent 
s in the product, where each factor 


When two quantities are multiplie 


An exponent is a number or a liter: 
to a given quantity to express a pro 
indicates the number of identical factor: 
is the given quantity. That is, by definition, 

(one factor X) 


xi=X 

x2 =XX (two factors X) 
ys = XXX (threes factor X) 
ym = XXX D (m factors X) 


xn Z XXX X (n factors X) 
7; X? is read “ X to the second power " or 
* Y to the third power” or *" X cube”; 


X™ is read “ X to the mt® power "'; and X^ is read “ X to the nth power.” 
y means of exponents may themselves be factors in 
le, the product of two powers of X, 


xx, 
each of which is a product. By definition 


X?= X:X 


x is read “ X to the first power 
simply “ Y square”; X? is read 


Products expressed b 
another product. For examp 


has two factors, X? and X°, 
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and 
X9 — XXX, 
Therefore 
X*. X? = X-X-X-X-X = X5, 
from which we see that we could have added the exponents “ 2" and “3” 
to obtain the exponent “ 5 " of the final product. In general, then, 
Xm. Yn = ymo 
Again, by definition, 
(X39 = X. X? Y? = Xs, 
from which we see that the power of a power can be obtained by multiplying 
exponents. In general, 
(Xn)n = yn, 
Division involving two Powers of the same quantity is performed by 


subtracting the exponent of the divisor from that of the dividend, as in the 
example, 


or, in general, as 


when m >n> 0. 


If we extend this practice of subtracting exponents to situations in which 
m — n, we find that 


A es], 
Given 
xm 
xm 
we divide the numerator and the denominator by X", Thus: 
X" i 
yey =! 
If we subtract the exponent of the divisor from that of the dividend, we 
obtain 
xm 
‘ya = APH us ye 
Therefore, 
T" a 
x» d 


By a similar extension to cases in which n > m > 0, we can show that 
P d 1 
ye = para = Xe- m) 


w 
p? 


Exponents 

where (n — m) is positive and —(n — m) is negative. Given 
xm 
rom 


with n > m > 0, we can divide the numerator and denominator by X", 
obtaining 
1 


x n= m 


where (n — m) is positive. But if we subtract the exponent of the divisor from 
that of the dividend in the original fraction, we obtain 


Xm-n 


which can be written 
XY-QoG-m) 


where (m — n) is negative and equal to —(n — m). 
We can also show that 


zx. 
X 
1 
= x3, 
X? 
and, in general where n > 0, 
1 " 
X^ eX 
Recall that 
X? — I. 
It follows that we can write 
1 AN 
Xr = Xx" x 


Dividing by subtracting exponents yields 
I xo 


= = yo-t = X=%, 


xX" xXx 


Logarithms are special cases of exponents. It is possible to take a given 


Number and express it as a power of 10. Thus 
100 = 10°, 
1,000 = 10°, 
10,000 = 10%, 


0.01 == = 10°, 
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and 


0.001 = — = 10-3. 
Also it can be determined that 
345 = 10253782, 
6,581 = 109.5182» 
and 
0.125 = 10--#0309, 


The exponent in the right member of each of the equality statements given 
above is a logarithm. The quantity 10 is referred to as the base. The number 
which is given expression as a power of 10, that is, the quantity in the left 
member of each equality statement, is called the antilog. 


Examine the relations among the antilogs and the relations among the 
logarithms or exponents. Note that 


100 < 345 — 1,000and 2< 2.53782 < 35 
1,000 < 6,581 < 10,000 and 3 — 3.81829 «4 
0.100 — 0.125 — 1.000 and —1 — —0.90309 — 0. 


Logarithms employing 10 as a base are called common logarithms. Another 
System, the so-called natural logarithms, employs as a base the constant e 


whose value is approximately 2.71828. We shall confine references in the 
following discussi 


If one keeps i 
to see that two 
of 10 and then 
performed by ex 


then subtracting and that a power of a given quantity 


rithm (exponent) of that quantity by 
re usually put in a form in which the 
only with the logarithms (exponents), 
ed. In each operation, after the com- 


ms and the use of logarithmic tables 
1, 1-9), Rosenbach et al, (2, 369-415); 


Roots and Radicals 


Consider the following definition: The nth root of a given number A is the 
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value of each of n equal factors whose product is equal to A. The nth root of 
A is written 
VA. 


Then X is the n'^ root of A or X = VA if 
A = XX X:::X (n factors X). 
vA, 


n is the index, 4 is the radical sign, and A is the radicand. 


In the radical, 


Examples: 
The second or square root of A is the value of each of two equal factors 


whose product is A. Then X is the square root of A if 
X-X =A. 


We also observe that the square root of 4 is A? since 
A2. Ale = A. 


Note that the square root of A is usually written as A with the index 


implied. 
The third or cube root of A is the value of each of three equal factors whose 
product is A. Then X is the cube root of A if 


X-X-X =A. 


And 41/3 is also the cube root of A since 
AB- A18 AB = A. 
in some instances negative, and in 


Roots are in some instances positive, 
ention /9 is assumed to refer to 


Others either positive or negative. By conv 
the positive root, +3; —4/9 refers to the negative root, —3. 
Radicals having the same index and the same radicand can be added or 
Subtracted. For example, 
J2X + 342X = 442X. 
cals with the same index is obtained by taking the 


The product of two radi 
e radical sign. For example, 


Product of the radicands and employing on 
4/5: 4/6 = 4/30. 

e same radicand and different indices 

ions to a form involving fractional 

For example, 


_ The product of two radicals with th 
is obtained by changing the express 
exponents and then adding exponents. 

43:43 = 312-318 = 33/6-32/6 = 3516 — 435. 


It is sometimes convenient to rew 
128 = A24/64. 


rite a radical in factored form as 
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The root of 64 can then be obtained by inspection and the entire quantity 
rewritten as 
842. 


Factorial Notation 


The product of all the positive integers from 1 to 5 inclusive, 
1x2*X3*x4*x 5, 


can be written in a more concise form by means of the following conventional 
notation: 


5! (read “ five factorial ”), 
In general, the product of all the positive integers from 1 to n inclusive, 


DG): + (n — 3)n — 2) — 1)(n), 


is written 


n! (read “ n factorial "). 


The Term in Algebraic Symbolism 


An algebraic expression is made up of one or more terms, a term being the 
product of a number and one or more literal number symbols. The product 
has a prefixed sign. Thus a term is a signed quantity. When the sign is not 
written, it is understood to be positive. 

The numerical factor in a term is called the numerical coefficient. When a 
factor of a term is an unknown or a variable, the other part, which is also à 
factor, is called simply the coefficient of the term. When no number appears 
in the term, the numerical coefficient is understood to be “ 1.” 


An expression involving a single term is called a monomial. An expression 
consisti 


ng of two terms is a binomial. An expression of two or more terms 
1 


is a multinomial. Here are some examples of each. 
Monomials: NX, —nY*, aX, —3 XY 
Binomials: X + Y, x — y, NX? — aXY, 3X3 — 2x2 
Multinomials: X? —2XY + y? 


a + bX + cX? 4 dX? 
XP F3X?Y 4 3xy24 ys 
pi 3p?g + 3pq? — È 
Technically these expressions are also polynomials but no attempt will be 
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made here to distinguish between the two terms, “ multinomial” and 
“ polynomial." 

Terms may be combined when the literal factors are identical. The 
numerical coefficients are combined and written with the literal factor, as 


shown in the examples below: 
3X --2X — 4X = X, 
2X? — 3X? + 5X? = AX?, 
2XY —3XY — 2XY = —3XY. 


Exercise 2.7 
Combine terms wherever possible in each of the following expressions: 


1. 3X* — X? + 4AX* 2. 4XY — 5XY + 2XY 
3. 3aX + 2X — 5aX 4. X + aX — 3XY 
5.3! —XxY- XY+ Y 6. nX — 2nX + 3X 

7. X —3X* — 2X* 8. XF Y2+ 2XY* — bY? 
9, 3X? — Y+ X* 4- 5Y 10. 3X*¥ — 3x Y* 


Answers to Exercise 2.7 


1. 6x? 2. XY 

3, 2: — DaX 4. X + aX — 3XY 
5. X* — 2XY + Y: 6. 3X — nX ' 
7. —4X? 8. 3X2 + Yr — bY? 
9. 4X* + 4Y 10. 3X*Y — 3XY° 


Exercise 2.8 


Combine terms wherever possible in each 
1. 3X + 3X? + 3X? 

3. 4cY + cY — 9cY 

S Xe xy— XY — Y' 

ds 

9; 


of the following expressions: 
2. 2XYZ — 3XY + XYZ 
4. aX 4- cY - AXY 

6. mX + b — 5mX 


+ AY? + AZ? 
2Y* -3Y*--4Y* 8.4X* H 
xe py xiv 4 XYs— X? Y 10. p t pq + Pd + pq? + pq + pq? 
pe +a 
the numerical coefficients, taking 
g rules for exponents to the literal 


+xy2+xy7-¥ 


Terms are multiplied by multiplying 
account of signs, of course, and applyin, 


factors. 


Exercise 2.9 

indica ultiplication. 
ace indicated multip: 2. NONY) 
3. (N X?)(nX) 4. (aX)(aY) 
5. GaX)4aX) 6. GadxY) — 
7. (SaX)GbX) 8. (abX?)GacX Y?) 
9. (—5X3)(4X*) 10. (C4Y)—3Y) 
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Answers to Exercise 2.9 
1. 12a? 

NnX* 

. 12a? X? 

. lSab X* 

—20X5 


ota 0 


Exercise 2.10 


Perform the indicated multiplication. 
1. (—5b)(6b?) 

3. (mX?)(nX?) 

5. (—5aX)(4-5a X) 

7. (6b)(—Sc) 8 
9 


. (2x)(3x)(4x) 10. 


Sea kN 
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N*XY 
axy 
20aXY 


. 3a*bcX* Y? 


I2Y* 


2. (nX)(nY) 

4. 

6. (8c)(cX Y) 

. (—5a)(—5b)(— 5c) 


(—a)(—b) 


(6X)(6.X)(6X°) 


One term is divided by another by dividing the numerical coefficients, 


taking account of the si 
factors. 


Exercise 2.11 


Perform the indicated division. 
1, 10X*/ax 


2. 
3. 3XY/X 4 
5. NX?*/N 6 
7. X? YlnY 8. 
9, FXXIXY 10 
Answers to Exercise 2.11 
1. 10X/a 2 
3.3 4. 
5. xX? 6. 
7. nx? 8. 
9. f 10. 
Exercise 2.12 
Perform the indicated division, 
1. 15a X*/5b 2. 
3. aX Y/bXY 4 
5. NXY/N 6. 
7. fXIfX Y 8. 
9. fX|fx* 10. 


In multiplying a binom 


gns, and applying the rules for exponents to the literal 


. 6Y/3 
. Il0aX?/2aX 
. N*XYINX 


SXI 


. fXYINXY 


. 30XY]20 
- 2bX*/2aX 
. n*[n* 


fXIN 
fXYIfx 


ial (or a multinomial) by a i iplication 
by the single term is distrib ) by a monomial, multip 


ut 
terms of the multinomial 


ed over the two terms of the binomial (or à 
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Exercise 2.13 


Perform the indicated multiplication. 


1. a(X — Y) 2. a(X — a) 
3. a(aX — bY) 4. a(X 4- Y) 

5. a(X + b) 6. «(Y — 2a) 

7. cX(X — Y) 8. bY(X* — Y?) 
9. ac(X* + a) 10. (a — c) XY 
Answers to Exercise 2.13 

l.aX—aY 2.aX—a 

3. à? X — abY 4. aX + aY 

5. aX + ab 6. cY — 2ac 

7. cX? — cXY 8. bX*Y — bY’ 
9. ac X? + a?c 10. aXY — cXY 
Exercise 2.14 

Perform the indicated multiplication. 

1. 3(X + Y) 2. P(a 4- b) 

3. a(x — y) 4. p(n — 1) 

5. (a — B) 6. N(N — 1) 

7. N(X? — X3JN) 8. (o%/n)(n — 1) 
9. plp — 9) 10. p(1 — p) 


f the answers to Exercises 2.13 and 2.14 can be viewed 
oduct into the factors 
given the 


Notice that each o s 
as a problem in factoring which means resolving a pr 


which when multiplied together yield the product. For example, 
expression, 
aX — aY, 
and instructed to factor it, we would examine the expression, note a factor 
common to each term, and write the expression in factored form as 

a(X — Y). 


The student should practice finding the common factor in each of the 


answers to Exercises 2.13 and 2.14. . 
Dividing a binomial (or a multinomial) by a monomial follows the pattern 


of multiplication. Using the monomial as the divisor, the division is distri- 
buted over each term of the dividend. 


Exercise 2.15 


indicated division. 
Perform the indicate 2. GX* + 6X9/3X? 


1. (aX? — aX)/aX 55 
3. (4aX? + 6a Y°)|2a 4. (2X + 3X? + 4X9/3 


Answers to Exercise 2.15 
1.X—1 2,X4-2 
3. 2X3 + 3Y? 4. (2 + 3X + 4X3/3 
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Exercise 2.16 


1. (15a X — 30aY + 45aZ)/15a 2. (X? + X? + X)5X 
3. (XY + XY + XY9))aXY 4. (2X3 + 2X? + 2X)2X? 
The product of two binomial expressions can be obtained by distributing 
the multiplication by one binomial over each term of the other. Thus 
(X + YXX + Y) 
can be rewritten, after distribution, as 
X(X + Y) Y(X + Y) 


which expression now involves two simple problems in the distribution of 
multiplication. The final answer is 


X? + XY + XY + Y2?= X*op2XY + Y. 
By the same operations 
(Y + a} = Y? -2aY 4+ æ 
(X—Yyox—2xyY- ys 
and 
(C-FIXXY-—Y)exixY--xy-yregyrf— ys 


Notice the form of each product and consider the reverse process, that is, 
resolving these answers into the binomial factors which produce them. 


Exercise 2.17 


Perform the indicated multi 


plication. 

1. (x + y 2. (x — 1X8 — 1) 
3. (n — 1)? 4. (X — M,)* 
5. (X — aX + a) 6. (a — d)(b — c) 
7. (Y + ey 8. (3X — 2} 
9. (Y — a(Y + b) 10. N(N — 1)(N — 2) 
Answers to Exercise 2.17 
1. x? + 2xy + y? 2. aß — a— ptt 
i APR 4. X? — 2M,X + Mj 

M Ves a 
i 6. ab — ac — bd + cd 
7. YH 2cY + @ Corsa 
9. I*— aY.LbY— ap 10. N? — 3N? + 2N 


Exercise 2.18 


Perform the indicated mu 


1. ( 20 Itiplication, 

« (& = — 1) 2. — = 

a. (n — 1)(m — 1) 4. Uu - ROSE us 
d ode 6. (1 + rall — ra) 
7. (X + Y} 8. (% + X is 
9. (X, + XQ, + Y) io. (X 4. pr 
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Exercise 2.19 


Y " ; , 
ou are given two sets of paired numbers, X and Y, with four numbers in each 


set. Write: 

1. the sum of X 
2. the sum of Y 
3. the sum of the sum of X and the sum of Y 

4. the sum of the four pair sums 

Compare your answers to (3) and (4). 
5. the sum of the squares of X 
6. the sum of the squares of Y 
7. the square of the sum of X 
8. the square of the sum of Y 

Compare your answers to (5) and (7); to (6) and (8). 
9. the difference between X and Y for any pair 

10. the sum of the four pair differences 

11. the difference between the sum of X an 


Compare the answers to (10) and (11). 
12. the sum of the squares of the four pair differences. 


d the sum of Y 


Answers to Exercise 2.19 


X, X:+ Xs Xs 
Y, + Yat Ys t Y. 
QG 4 Xi Met XO (O8 Yat Ys Yı) 
Qr, + Y) + (Xa + Yo QG  Y9 + Xat YO 
XicoXic X$+ Xi 

Y? d YP4 vit Yi 

0G + XE Xa + XY 

«(Yi + Yat Ys + Yo? 
.X;— Y;orX — Y 

10, (X, — Y.) + (X — Ye) +% 
Tie (X, Xa t X4 X) — Qi Yit Ys 
12, (X, — Y + Oa — Yo? + Q8 — YO 


o0 do tnRO.M-— 


— Y) + Q6 — Y) 
TY) 
Qa — Y 


Exercise 2.20 


_ We are given two sets of 
in one set, Let vı, vs, and Ya 
constant value, a, to each number in the « se 


B, to each number in the v set. The three num 
tou, + a, us + o, and us + % The three numbers in the v set are now v, + B, 


Va + B, and v, + B. Let us refer to the numbers in the new sets as w and v’. 
Write, in terms of the original numbers and the constants, 


the sum of w 


. the sum of v 
the sum of the sum of x and the sum of v’ 


. the sum of the three pair sums 
. the sum of the squares of u 
the sum of the squares of v 


paired numbers. Let i, us, and u, be the three numbers 
be the three numbers in the second set. We shall add a 
t. We shall also add a constant value, 
bers in the u set are now transformed 


QN ta f S 1 — 


42 Arithmetic and Algebra 
7. the square of the sum of w 
8. the square of the sum of v’ f 
9. the difference between u’ and v' for any pair 
10. the sum of the three pair differences , 
11. the difference between the sum of w’ and the sum of v 
12. the sum of the squares of the three pair differences. 


Equations 


An equation is the statement of an equality which does not hold for some 
values which might be assigned to the literal symbol(s) in the statement. 
When an equation involves one literal number symbol, it is said to be a 
equation in one unknown and the number represented by the letter can be 
determined. When an equation involves two different literal number symbols, 
it is said to be an equation in two unknowns and it is possible to determine 
the number represented by one of the letters if a value is given to the m 
Thus an equation in two unknowns expresses a relation between two me 
symbols. When an equation involves three unknowns, the number represente 
by any one letter can be determined if values are assigned to the other two. 


Exercise 2.21 


Write the following as equations. 

- X is equal to 7. 

. The product of 5 and Y is equal to 15. 

X plus 3 is equal to 5. 

- Six times Y is equal to 18 plus 3 times Y. 

Three less than the product of 5 and X is equal to 32. 

- X minus 10 is equal to X". 

The difference, Y minus 10, divided by 5 is equal to Y". 
- The difference, X minus M,, divided by S,, is equal to Zy 
- Y is equal to the Square of X. 


10. Y is equal to 20 plus the product of 5 and X. 


WP 90 nO (fA BO M o— 


Answers to Exercise 2.21 
ca SF 


1 2. 5X = 15 
3.X43-5 4. 6Y — 18 - 3Y 
5. 53 = 35:32 6. X — 10 = x’ 

7. (Y = 10/5 = y: 8. (X — MJS, = Z; 
9. ¥ = x? 


10. Y 2 20+ 5x 
Exercise 2.22 


Write the following as equations. 
1. Two more than 5 times a 


n unknown value, X, is equal to 42. 
2. Five less than the recipro 


cal of the unknown, Y, is equal to 15. 
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3. Twenty-eight times the quotient, N divided by the quantity, N minus 1, is 


equal to 30. 
4. The sum of the four numbers in the X set is equal to 10. 
i 5. The quotient obtained by dividing the sum of five numbers in the Y set by 5 
is equal to 4. 
6. The sum of the squares of the 
7. The sum of the squares of the five numbers i 
8. The sum of the third powers of the four numi 
9, The sum of the fourth powers of the four num 
10. The square of the difference between two unknowns, 


four numbers in the X set is equal to 30. 

n the Y set is equal to 90. 

bers in the X set is equal to 100. 
bers in the X set is equal to 354. 
X and Y, is equal to 25. 


SOLVING SIMPLE EQUATIONS 


In an equation containing a single, literal number symbol, that symbol is 
called an unknown and determining the number(s) represented by the symbol 
is called solving the equation. Each such number is called a solution of the 


equation and is said to satisfy the equation. 

The fundamental principle in solving an equation in one unknown is that 
certain arithmetic operations can be performed on both sides of the equation 
yielding a new equation in which the literal symbol represents the same 
number(s) as it did in the original equation. That is to say, given an equation 


in one unknown, we can 

(1) add the same number or expression to both sides, 

(2) subtract the same number or expression from both sides, 

(3) multiply each side by the same number, or 

(4) divide each side by the same number , 
and the new equation will have the same solution as the original. 

Caution: There are other operations which must be employed with 
discretion. Multiplying both members by an expression containing the 
unknown may yield an equation in which the literal symbol represents 
numbers, one or more of which will not satisfy the original equation. For 
this reason, all solutions should be checked in the original equation. Dividing 
both members by an expression containing the unknown may result in a loss 
of a solution. Raising both sides to a power poses the same problem as does 
multiplication by the unknown. Extracting the same root on both sides 
presents the same problem as dividing by the unknown. 


An equation in which the exponent of the unknown is “ one” is a linear 


equation. To solve a linear equation in one unknown: 
1. Clear the equation of fractions by multiplying both sidès by the least 


common denominator of the fractions. 
2. By addition and subtraction, obtain an equation with all terms contain- 


ing the unknown on one side and all other terms on the other side. 


3. Combine terms containing the unknown. 
4. Divide both sides by the coefficient of the unknown. 


qui 
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Here is an example of the solution of a linear equation in one unknown. 
Given the equation, 


2x —4 
$ —5 


we multiply both sides by 7. The new equation is 
2x — 4 — 28. 
We now add 4 to both sides. The equation becomes 
2x32. 
Finally we divide both sides by 2 and obtain the solution, 
x — 16. 


The solution can be checked by substitution in the original equation. 


Given an equation in two unknowns, one can solve for one unknown in 
terms of the other. Given the linear equation, 


2x + 3y = 7, 


one can solve for y in terms of x. First, subtract 2x from each side, thereby 
obtaining a new equation, 


3y = 7 = 2x. 
Next, divide both sides by 3 and find the solution for y to be 
_1=% 
3 


An equation in two unknowns has an unlimited number of solutions, bY 
which we mean pairs of numbers which will satisfy the equation. 
Exercise 2.23 


Solve each equation for the unknown appearing in it. 


ae ees 2. 6¥ = 18 

+ 3Y — 

2 3 = 12 4, T =5 

5. XY* 43 — 19 6. 2X? — 8 = 42 
7. (X — 3)? = 25 8. X* — 6X +9 = 25 
9./YcT 6-3 10. VY +6 —/2Y 
Answers to Exercise 2.23 

1. = iS 2 Y=3 

3, 321 4. Y — 16 

S. X edd 6. X = 45 

7. X =8 or 2 8. X =8 or —2 
oque 10. Y 6 


Equations 
Exercise 2.24 


45 


Solve each equation for the unknown appearing in it. 


1, 5X — 15 
3. 6Y — 3Y +18 


5.5X42—42 


28 N 
de Nes 


30 


Exercise 2.25 

In each problem solve for th 
1l. X4-225Y 

Y= 

2X+7Y = 18 

X= 

. 3(X — Y) = 24 


3; 


Answers to Exercise 2.25 


1. Y 2 (X + 2/5 
3. X = (18 — 7Y)/2 
5.X—-Y-^8 

7. X — (Y — alb 
9. X=x+ Mi 


Exercise 2.26 
In each problem solve for the ind 


i X+a= BY 
Y= 

3. 3a + 5S8 = 9 

5.a(XY — Y)- 5* 

7. Y' = (Y — ayb 
Y= 

p= 


2 
4 


6 


8 


10. 


2 


.X-325 
. 5X — 3 = 32 


1 
T peas 


. 1 — r&y = 0.36 


. (X + 2)? = 25X" 


e indicated unknown in terms of the other unknowns. 


m+1=9%n+ 1) 


a= 


4.p+q=1 
p= 

6. (x — y)? = 36 
x= 

8. Y' = (X — 50)/10 
X= 

10. Zy = = M;)/Sz 
Mz = 

2. n=(m+ 0/9— 1 

4.p=1-4 

6 x=y+6 

8. X = 10X' + 50 

10. Mz = X — ZS; 


icated unknown in terms of the other unkn 
2. 


4. 


10. 


owns. 


a—l12mB-—l 
p= 

3M, = X t+ Xs Xs 
Mz: = 
= = (1+ oF 
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SYSTEMS OF EQUATIONS 


Suppose that we have two linear equations, each involving the "o pe 
unknowns. It is possible that a pair of numbers which satisfies Pie 
equation will also satisfy the other. It is also possible that only one pa Ar 
numbers will satisfy both equations. When two linear equations are satis - 
by one and only one pair of numbers, they constitute a system of. pete E 
linear equations and we can solve for the numbers represented by the letters. 

Here is an example of a system of two simultaneous equations: 


x+y =6 
5x — y = 18. 
One way to solve this system of equations is to perform an arithmetic 


. H H H e 
operation which will yield a new equation in only one unknown. Examin 


the two equations given above. If we add the left numbers and then add the 
right members, we obtain 


(Sx — y) + (x + y) = 24, 
or 


6x — 24. 

We can then divide by 6, Obtaining the solution: 

x = 4, 
We can now find the value represented by y, by substituting 4 for x in either 
of the original equations and solving for y. Substituting in the first equation 
and solving for y gives 

4+y=6 
and 
¥ = 


We can check these values, x = 4 and Y = 2, by substituting them in the 
original equations to see if e 


ach equation is satisfied. Substituting in the 
first equation gives 
44+2=6 
Substituting in the second equation yields 
5(4) — 2 = 18. 


Note the varied operations which can be employed to eliminate one of the 
unknowns, 
Ll. &, De y=5 
b. x ^y 


Subtract (b) from (a) to eliminate y in (c). 
€. x—2 
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Substitute for x in (a) or (b) and solve for y. 
y=1 
Check by substituting the two solutions in (a) and (b). 


2. a. 5x — 3y = 21 
b x+y =1 
Multiply (b) by three to obtain (c). 
c. 3x + 3y =3 
Add (a) and (c) to eliminate y in (d). 
d. 8x = 24 
Solve for x. 
x = 3 
Substitute for x in (a) or (b) and solve for y. 
y=? 


Check by substituting the two solutions in (a) and (b). 
3. a. 7x + 2y = 38 

b. 4x + 3y = 31 

Multiply (a) by three to ob! 

c. 21x + 6y = 114 

d. 8x + 6y = 62 

Subtract (d) from (c) to eliminate y in (e). 

e. 13x = 52 


Solve for x. 


tain (c). Multiply (b) by two to obtain (d). 


—4 


Substitute for x in either (a) and (b) and solve for y. 
y-5 


Check by substituting the two solutions in (a) and (b). 
Systems of simultaneous equations in more than two unknowns can be 
solved by similar procedures. With three equations in three unknowns, 
operations are chosen which will yield two equations in two unknowns; then 
further operations are performed to produce one equation in one unknown. 
After solving for this single unknown, we £o back, substitute in an equation 
with two unknowns and solve for a second. Then we go back to one of the 
original equations in three unknowns to obtain the value of the third. 
y substitution in all three of the original equations. 


Checking is accomplished b c 
Here is a demonstration of the solving of a set of three simultaneous linear 


equations in three unknowns. 
a. 3x —2y +2 = 2 
b x— 2y + 3z = 6 
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c. 2x — y z= =3 . . 
Subtract (b) from (a) to eliminate y in (d). 
d. 2x — 2z = —4 
Multiply (c) by two to obtain (e). 

& 4x —2y —22 = —6 
Subtract (e) from (b) to eliminate y in (f). 
f. —3x + 5z = 12 
Multiply (d) by five to obtain (g). 
g. 10x 10z = —20 
Multiply (f) by two to obtain (h). 
h. —6x + 10z = 24 
Add (g), and (h), to eliminate z in (i). 
i 4x 
Solve for x. 

x=1 
Substitute for x in (d) and solve for z. 


253 
Substitute for x and z in (a) and solve for y. 
y=2 
Check by substituting all three values in (a), (b), and (c). 
Exercise 2.27 


Solve each set of equations for the unknowns appearing in them, 
1. bi(5) + b,(3) = 25 


2. a(20) + a4(16) + a,(30) = 208 
b,G) + 6,2) = 16 a,(16) + a,(10) + a,(10) = 102 
(30) + a.(10) + a,(8) = 122 


Answers to Exercise 2.27 
1, by = 2:5, 5 2. a, = 2;a, =3;a, = 4 
Exercise 2.28 


Solve each set of equations for the unknowns appearing in them. 
1, e(106) + 0404) = 520 2. a(10) + b(12) + e(15) = 116 
e,00) + e4) — 160 4(12) + b(8) + e(20) = 128 
a(15) + B(20) + (16) = 154 


Functions and Graphs 


A function is a Specification of the 
mbers. The relation is usually exp 


relation between two or more sets Of 
one set on the other(s), When a lett 


ressed in terms of the dependence of 


er symbol represents any number in ^ 
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set of numbers, the letter symbol is said to be a variable. There are two 
principal ways of incorporating variables in the writing of functions. In the 
one way, the number represented by an algebraic expression is shown to 
depend on the number or numbers represented by one or more variables. 
The number represented by the expression is called the value of the function. 
The variable on which the value of the function depends is called the 
argument. Two examples of this first way of writing functions are given below. 
|, The expression, 
x + 2) 


is a “function of Y": that is, the number represented by the expression 
depends on the number represented by X. The number represented by 
(3X + 2) is the value of the function; X is the argument. 
2. The expression, 
(X4 + Xo + X3) 

The number represented by (X, + Xs + X3). 
ree arguments: Xi, X» and X;. 
nction is to employ an equality 
the number represented by one 


esented by one or more other 
is the value of the function and 


is a function of Y,, Xa and Xs. 
is the value of the function. There are th 

The other principal way of writing a fu 
statement and specify the relation between 
variable and the number or numbers repr 
variables, If y is said to depend on x, then y 
x is the argument. Two examples are given below. 

1. The variable y is expressed as a function of the variable x by the equality 


statement: 


y = 3x? + 2x S 


Here y is the value of the function and x is the argument. 
2. The variable M, is expressed as a function of the variables Xi, X», Xs. 


and X, by the equality statement: 
xX, + Xs + Xs d X, 
M, = 5 i 
M, is the value of the function; X,, Xs, Xa and X, are the arguments. 
Recall that a literal number symbol sometimes refers to a particular 
number of unspecified value rather than to any number in à set, Such literal 
symbols are said to be constants. Numbers themselves are, of course, 


constants. i , " 
A function involving only one argument lends itself quite readily to 
. ai : 5 hat we have a function involving 
graphical representation, Let us suppose t ; 
x and y, where x is the argument and y is the value of the function, For a 
graphical representation. we employ two intersecting straight lines, a hori- 
zontal line for the scale of x and a vertical line for the scale of y. The 
horizontal line is called the x-axis. The vertical line is called the y-axis. The 


3 Arithmetic and Algebra 


lines form a right angle at their intersection. The point of intersection is 
called the origin. The two lines are called the coordinate axes. i 
Each scale is marked off in equal intervals and numbered from zero si 
origin. The x-axis is numbered from the origin to the right with e x 
numbers and to the left with negative numbers. The y-axis is numbered fro 


ier i ive 
the origin upward with positive numbers and downward with negati 
numbers. 


Given an equality statement showing the relation between yandx,a value 
of y can be computed for any value assigned to x. These two values lodi ni 
a pair which can be represented as a point located in the I 
space in which the coordinate axes are located. The two values are calle 
the coordinates of that point. The point is located by the intersection of two 
lines, one drawn perpendicular to each axis at a point on the scale correspond- 
ing to the value of that coordinate. The coordinate which locates the Ene 
drawn perpendicular to the horizontal axis is called the abscissa, The co 


ordinate which locates the line drawn 


perpendicular to the vertical axis 1S 
called the ordinate. 


The coordinates of a point are written (x, y) with the 
abscissa first and the ordinate second. 

For a given function we 
continuous line which woul 
would satisfy the equation, 


The linear function 


might locate a number of points and iain 
d indicate the locus of points whose coordinate: 


Y —0.5Y +1 


yields a straight line when it is plotted. This line intersects the vertical aXI 


case we say that the Y intercept is one. The E 
) on the horizontal axis, is —2, The comen 
of the line, the distance the line moves vertically 


Y 
E —0.5 
p 1 
a 2 
. The following quadratic equation is an example of a function whose plot 
Is a curved line: 
Y= 2x? 4X 4 4, 


E 
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-E—4— ie t 
-€ 1 414243 +4 t5 + 

T-! 

-2 

-3 

-4 

-5 
1-6 


Fig. 2,1. Plot of the linear function, Y = 0.5X + 1. 


1 
6 


x 
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Listed below are paired values of X and Y. These paired values have been 


plotted in Fig, 2,2, A smooth curve has been drawn through the points, 


Y y 
4 52 
3 34 
—2 20 
=Í 10 

0 4 

1 2. 

2 4 

3 10 

4 20 

5 34 

6 52 

Y 
+64 
+32 
Eii AA [ri 44 48 Fo 
-32 
—64 


Fig.2.2 Plot of the quadratic function, 


Y —2X? — AX +4. 
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i this 

The Y intercept of the function in Fig. 2.2 is 4. Note ie iue om 

curve varies, depending on X. The slope can be shown grap die a poit 

a line tangent to the curve at any point. A tangent has e siege df the 
P (4, 20) in Fig. 2.3. The slope of the tangent at point P is 


i have a 
curve at that point. Note that a tangent at any other point would 
different slope. 


T 
+64 


P (4, 20) 


+344 4546 X 


oint 
Fig.23 A straight line has been drawn tangent to the curve of Fig. 2.2 at P' 
P (4,20). The function is Y 


t the 
= 2X* — AX + 4. The slope of the tangent, 4X — 4, à 
point P is equal to 12, 


, ; by 
The slope can be expressed as a function of X. It is determined 
differentiating the expression 2X2 


tive—that is, the slope—is equal to 
when X < 1 and positive when 
when X = 1. In this particular e 
occurs at the point where the slo 
on the curve is located where 


: E s n 
involves the determination of the number represented by a literal symbol 


value of the expression in which a 
an generally be obtained by 2 = 
ng the derivative, setting the reel 
quantity represented by the sym 


Pe. 
yo SER 


, WM e ion D 
obtained by multiplying and dividing the right-hand side of the equation A 
the coefficient of X2. The numerator 


é : ws: 
on the right can be rewritten as follo 
"Differentiation is a maj 


n esent 
or topic of calculus and is beyond the scope of the PT 
review. 
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(4X? — 8X +4) +4 
de a) 


or --— 
Qx — H4 
y= = 
Since (2X — 2)? cannot be negative and must be equal to or greater than 
Zero, we observe that Y will have its minimum value when 


Qx —2y = 0, 
in which case 
(2X — 2) =0, 
2X = 2, 
and 
yal 


The minimum value of Y can be determined by substituting this value of X 


i igi i 
n the original equation, y 2X? —4X + 4. 


Substituting, we find that -—— + à 
or 
Y 2. 


Functions of Angles 
e axes and a line which coincides with the 


i i f a clock, is free to 
d which, like the hand of 7 
cat In Fig. 2.4, the line OQ is shown 


Imagine a set of coordinat 
Positive segment of the X axis à 
rotate clockwise or countercloc 


¥ 


+6 


41 +2 43 44 +5 +6 
-1 


-6 


Fig. 2.4 The angle @ generated by the counterclockwise rotation of the line OQ. 
«2. e angle 
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its initi iti i sition 
otated counterclockwise from its initial position to the terminal po: 
T: 
indicated. 


In trigonometry, an angle is the amount of rotation i m ks e ha 
line from its initial position to a terminal position. The coa 
can be measured in terms of degrees, minutes, and seconds. — 

If we let O be the point of origin, P be any point ona rotate x d 
be a perpendicular erected on the horizontal axis at point M an n den 
through P, we can identify a triangle, OPM. An example ofa pias ee 
mined in this fashion is shown in Fig. 2.5. We can also identify t a sf de 
corresponding to the three sides of the triangle. OP is the B h iets 
hypotenuse of the right triangle; OM is the distance of M from the 


i i he 
one of the coordinates of the point P; MP is the distance of P from tl 
horizontal axis, the other coordinate of P. 


M 
F———tF—L—L— ib Xx 
76 -5 -4 -3 -2 -1 O| 41 42 43 44 45 có 
T-1 


T 


1-3 


T-4 


r been int 
Fig 2.5 Determination of the right triangle OPM. P is an arbitrarily chosen pO 
online OQ. The line from M to P is perpendicular to the X axis. 


For convenience, let us use r for OP, X for OM, and Y for MP, and 0 - 
the amount of rotation. Functions of the angle 0 are defined by certain Won 
involving r, Y and Y. We shall consider here only three of these ratios: t 
sine, the cosine, and the tangent of the angle 6, 

Sin 8 is read * the sine of the angle theta ” 


defined as the rat 


i » is 
or simply “ sine theta " and 
lo of Y to r. That is, 


sin 0 es. 


T 
he cosine of the angle theta” o; 
o of X tor. That is, 


Cos @ is read “ t 


> is 
r “ cosine theta ” and 
defined as the rati 


cos ĝ zx 
r 
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Tan @ is read “ the tangent of the angle theta " or “ tangent theta ” and is 
the ratio of Y to X. Thus 


Y 
tan 0 =X 


Values of sin 8, cos 0, and tan 9, as well as the other trigonometric functions 
not given above, depend on the amount of rotation of the line and not on 
the distance OP = r which locates the point P on the line OQ. In other words, 
for any given amount of rotation, the sine, cosine, and tangent are fixed 


values, 

The student should try to integrate these i 
angles with his ideas about functions in general. A trigonometric function 
involves two variables: the value of the function (a ratio) and the argument 
(the angular amount of rotation). Values of these functions are widely 
available in tabular form. See, for example, Hodgman (1, pp. 87-110) and 


Walker (3, pp. 330-331). 


deas about the functions of 
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THE SIGN OF SUMMATION 


Syntactics | ő 


sigma,” formed %, is 
ent a number, It is an 


» the symbolism would be excessively 
peration of summing can be indicated ee 
a single letter, statistical symbolism can 
with it is increased immeasurably. X 
Ssion of the sign of summation, we shall let ii 
set, there being N numbers in that set. We sha 


ing N numbers. We require that Y,, Y}, and Zi 


À n 
» be classified together, and no 
Such triplets, the members of each triplet bea 
identical subscripts. 


Tre 
e 3.1. Columns of the table correspond to the letters Y, Y, and 


Table 3.1 
A CLASSIFICATION OF NUMBERS 
Columns 
Xx Y 
X, n Z 
2 X, Y. Zi; 
3 X, I Z, 


Rows 


N Xy Ys 2y 
56 
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Rows of the table correspond to the triplets and can be identified by the 
common subscript of the three letters representing the numbers in a particular 
row. The table is rectangular and possesses as many columns as there are 
sets of numbers and as many rows as there are numbers in each set. 
In the discussion that follows, the letters a, b, and c are constants, that is, 
particular numbers of unspecified value. 
Here are the definitions of some fundamental expressions in which “ 2” 
occurs, 
1, XX =X, + X, + Xs due + XN 
ZYX is read “ sum of X." 
The operation is equivalent to summing the numbers in the first column 
of Table 3.1. 
2. DY = Y, + Ya + Ys + + YN 
ZY is read “ sum of Y." 
The operation is the summing of numbers in the secon 
Table 3.1. 
3. ZZ =Z, +Z + Zs +.. + ZN 
2Z is read “ sum of Z.” 
The summation involves the numbers in t 
Notice that each of the summations given above has N terms. 
4. (EX = (X, + Xa + Xs +o XY 
(2X)? is read “ the square of the sum of ax 
The operation involves adding the N numbers in the first column of 
Table 3.1 and squaring the resulting sum. Similar expressions could be 
Written for the square of the sum of Y and the square of the sum of Z. 
5. Zxs— X24 XP XP ee + X 
2X? is read “ the sum of squares of X 
The operation requires computation of the square of each number in the 


first column of Table 3.1, followed by the summation of these squares. AY? 
and SZ? are defined in a similar fashion. Notice that 
EX? x (ZXP. 


d column of 


he third column of Table 3.1. 


The sum of squares of X is not equal to the square of the sum of X, since 


Ext yg X24 XP. c XM * 
EP Aa TATE re de Xyk Regt ane XP m GERD 
t terms like 2X, X5; these product terms 


The quantity, (2X)? involves produc ese p 
A numerical example is given below. 


do not appear in the quantity, Ex 
poscezQ-3i4 
29 Æ 8l. 
6. XX 4 Y)- Qn + Yo + GG + Ya) +... + (OX + Yn) 
XXX + Y) is read “ the sum of the pair sums of X and Y." 
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This operation involves two summations. Referred to Table 3.1., it pic 
adding the two values in the first and second columns of each row. Then 
these row or pair sums are added. Notice the pairing of X and Y. 


7. EaX = aX, +aX,+aXy+... tay . 
2aX is read “ the sum of products of the constant a and the variable de 
To perform this operation on Table 3.1, one would multiply each value in 
the first column by a and then sum the resulting products. 
8. Za —a-taca4 ++. c à (N terms a) 


Za is read “sum of a” and implies the repeated addition of the 


constant. Similar expressions could be written for b and c, and for other 
constants. 


9. EXYY = XY, 


TOY, SY +... + ky Vy 
ZXY is read ‘ 


“the sum of products of variables Y and Y." 


products. Notice the pairing of Y and y. 


Exercise 3.1 


Write, in Symbols, the defin 


ition of each of the expressions listed below. X and Y 
are variables; a, b, and c are 


constants, 
1. Sx 2. AX + Y) 
3. Sax 4. Zc 
5. ZXY 6. zy: 
7. XbcY 8. Zh? 
9. (X + yy 10. Xx] y) 
11. (ZY)? 12, Zax: 
Answers to Exercise 3,1 
y Xi P X. + Ny + toe t+ Xy 
“(M+ Y) + G4 E + (Xn + Yy) 
. aX, + aX, + aX, + ss) +aXy 
eee. ls (N terms c) 


"Ht E XY. 


M: 0 MO ta 0 NS — 


2 oec UN + Yy} 
10. XY, + xy, 4. VdY. + Xy Yy 
lt. (5 4- Y, 4. T A 
12. aXi + aX? 4 ay: 


Exercise 3,2 


Write, in Symbols, the definiti 
b, and c are constants; 
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|l. zz 2. XX4 Y 2 

3. aXX 4. Xxyz 

5. Ex* 6. ZabX 

7. EX + ZY 8. Ze 

9. XX — YF 10. Z(YJZ) 
11. (ZZ) 12. Sa = Y) 
Exercise 3.3 
Translate the following verbal expressions into algebraic expressions involving 

"ye. 

1. the sum of X 

2. the sum of the pair sums of X and Y 

3. the sum of products of X and Y 

4. the sum of squares of Y 

5. the square of the sum of X 

6. the product of the sum of X and the sum of Y 

7. the square of the sum of products of X and Y 

8. the sum of products of a constant, a, anda variable, X 

9. the sum of products of a constant, a, and two variables, X and Y 
10. the sum of N numbers, each of which has the constant value, a 
11. the sum of the ratios of Y to X 
12. the ratio of the sum of Y to the sum of X. 


Answers to Exercise 3.3 


]. XX 

3. ZXY 

5. (2X)? 

7. (ZXY) 
9. axy 
11. 2(Y/X) 


Exercise 3.4 


Z(X + Y) 
ZY? 
ZXzY 
Zax 

Za 
ZY|zX 


[RI 


Translate the following verbal expressions into algebraic expressions involving 


“pn 
Lv. 


. the sum of the N numbers in the Y 
. the sum of the Y-and-Z pair sums 


- the sum of the cubes of X 

. the square root of the sum of X 
. the product of the sum of X a 
- the square root of the sum o 
the square of the sum of pro 
- the sum of products of the constan 


WHONADAUVAWN— 


set 


- the sum of the N products of X and Z 


nd the sum of Z 
f products of X and Y 


ducts of a and X 
t a and the variable, the reciprocal of X 


10. the repeated addition of the constant a 


11. the sum of products of the reciproc 
12. the sum of the ratios of the constan 


als of X and Y 


t a to the variable X. 
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Simplification Rules 


i i ich are 

Certain expressions involving “ X" can be rewritten in forms o wc 
more convenient for computation and which may be necessary enm 
reasons to be discussed later. The rewriting of these E ria ia 
simplification. There are four rules which can be applied singly E v le dim 
combinations to effect these simplifications. The four rules are liste 
plained below. 

RULEI. SY + Y) = ZX + Xy. 


i [X 
The sum of the X-and- Y pair sums is equal to the sum of the sum o 
and the sum of Y. 


Proof: By definition, ee 
A(X + Y)=(X, + Y) + (X, + Yo) + (X4 + Ya) F ... TON + Yn). 
s 

The terms in the right member of the equality statement can be regrouped a 


(X + X, d- S uuu FAD CY, Se Ya YS ow Fo YH), 
which expression can then be written as 


ZX + Ey. 


Caution: Rewriting Xy + Y)as DX 4. Sy looks superficially like = 
distribution of multiplication by “ X”, but this interpretation is not erro 
As the symbol “ X» is employed in this book, it does not represent a number, 
and, taken alone, is never a factor in multiplication, 

RULE Il. ZaX = a Zy, 


The sum of the products of a Constant and a y 
nd the 


ariable is equal to the product 
sum of the variable, 


of the constant a 
Proof: By definition, 
Zax 

We observe that a i 
the equality. We ca 


= aX, +aX,+aX,4.., + aXy. 


; A f 
$ a factor common to each term in the right member o 
n remove this common factor a 


a(X, *X X, — 
Which can then be Tewritten as 


az. 


Simplification Rules el 


RULE n. Xa = Na. 

The repeated addition of a constant, when there are N terms in the 
summation, is equal to the product of N and the constant. 

Proof: By definition, 

Sa=at+at+at+...+a (N terms a), 
but recall that, in arithmetic, the multiplication, N x a, is defined in the same 
way. That is, N x a is defined as the repeated addition of a, there being N 
terms in the addition. It follows that 
Xa = Na. 


RULE IV. ZXY cannot be rewritten in any form which simplifies com- 

putation. By definition, 
ZXY = X,Y, + XsYs + XsYs b s + XnYn 

and, with respect to the multiplication and the addition, there is only one 
Correct order of operations, multiplication before addition. 

Caution: The sum of products of two variables is not equal to the product 
of the sums of the two variables. That is, 

EXY # ZX£XY 
or 
XY, + XaYa +... + XwYn # 
(X + X:+.. + Xn Yı + Ye +- + YN). 

um of products for two variables does not 


A numerical example in which the s i 
he two variables is given below. 


equal the product of the sums of t 
DXZEAXSHEOXTAHA+4+ HB 4547 
68 Æ 180 


Exercise 3.5 


Rewrite the following expression 
above. In addition to a, b, and c, N and numeral 


are to be considered variables. 


s in simplified form by applying the rules given 
s are constants. All other letters 


1, Z(x — y) dE 
Pl 4. 3(X/Y) 
s XY + a) : ory 
- Zac 
9. X(aX — c) " =f ¥ 
ll. ZpiN Ed 
2s. Z(bX — cY) ^ E) Y) 
- Z(a — b) ; "nad 
17. X(Xa) es P » 
=g 


19. 2X/ 2Y 
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TEF 
21. Z(a[Y) mm 
24. X(X — a/N) 
X(a]N. 2 

e pais] 26. Z4aY 

27, E(X = 2) 28. zy: 

29. ZXZX 30. Z(a — x) 

Answers to Exercise 3.5 

1. Zx — Ly 2. bZY 

3. N? 4. 2(X/Y) 

5. ZY + Na 6. ab SX 

7. Nac 8. yay 

9. aZX — Nc 10. cLXY 

X+yY* 

11. po 12. >* 5 

13. bEX — c £Y 14. a ZX? . 
15. N(a — p) 16. ZY*[ -2zyy + xy? 
17. (l/a) EX i8. Naje : 
19. ZX|zy 20. ZY* — 2e Zy +. Ne 
21. a X(1/ Y) 22. (lla EX + xy) 
23. a 24, EX —a 

25. 3Zy:y 26. da Sy 

27. ZX — 2N 28. zy 

29. ZXEX or (2X)? 30. Na — zx 


2221 
Z Z 
computation. 


Exercise 3.6 


is also correct but would not be considered 


— f 
a simplification O 


Rewrite the following expressions in 
simplification rules. In addition to a, b, 
constants. All other letters 


: " " ur 
Simplified form py applying the ee 
c, N, and numerals, Greek letters 


are variables, 


L XX+ y 4 2) 2. Zax 

3. Zabc 4. EXYz 

5. Z(a + by) 6. Zay: 

7. EN? 8. ZYzyzz 
9. Xax — by) 10. Zay:y 
11. Ze — 

12, > Xil 

13. Z(ax + 3) 14. Zay? 
15. Z(a + B) 16. BCX + a)? 
17. 2(¥/c) 18. =(ab/c) 
19, (2X 2Y)/ oz 20), (yr =: ô)? 
21. Z(ajy) 22. DEFFLIE 
23. Z(ajg) i 

i B 24, Z(XIN) 
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25, 25 26. SSN 
27. X(X* — 10) 28. ZX*Y: 
29, (ZX)? 30. Z(a — X/a) 


The Summation Sign in Equations 


We have been employing literal number symbols as constants and variables, 
and we have been writing the Greek capital letter “ Z” to indicate the 
Operation of summing. To indicate a summation, " Z” will always be 
Written with one or more other symbols; it will not be written alone. When 
“ X" is written with another letter, number, or combination of letters and 
numbers, the entire expression represents a number or a quantity which can 
be used in further arithmetic operations or computation. In solving a single 
equation or a system of simultaneous equations, a factor or a term involving 
the summation sign is treated as a literal symbol rep 
Occasion, the solution of an equation for some specifie 
application of the rules of simplification. 

Given below are some examples of operations performed on equations 
containing one or more summation signs. 

1. The problem: Given M: = SY/N, solve for 2X. 

The solution: Multiply both sides of the equation by N and obtain 


SX = NM. 


N), solve for 2x°. 
the equation to obtain 


resenting a number. On 
d unknown may require 


2. The problem: Given Sy = (27) 
The solution: Square both sides of 
S2 = ZXxWN. 


Multiply both sides by N to obtain 
3x? = NS}. 


3. The problem: Given 
PoyNSxSy = ZX}, 
Solve for Tas 


The solution: Divide both sides of the equation by NS,S, and obtain 


EXW 

lsu — NS,S, 

4. The problem: Given —Á 
^ 2x Li 

Se = UN d 


Solve for the quantity, N — 1. 
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The solution: Square both sides of the equation to obtain 
2 Ax . 
|N—] 
Multiply both sides by (N — 1) and then divide both sides by S; to obtain 
the answer, 
zx 
= s? 


N-1 


5. The problem: Given 
A(X — M.) = 0, 
where M, is a constant, solve for Mz. 


The solution: Ap 


ply the first and third simplification rules and obtain 
the equation, 


ZX — NM, = 0. 
Add NM, to both sides and then divide both sides by N to obtain the solution, 


6. The problem: Given 


Where a and b are constants, solve for DY, 


The solution: Apply the first, second, an 


d third simplification rules to 
the expression on the right side to obtain 


2 = ; (2X — Na). 


Multiply both sides by b and then add Na to both sides to obtain the solution, 
2X = bEX' + Na, 
7. The problem: Given the simultaneous equations: 


(1) ax? + bXxy = 2xz, 
(2) aXxy + bay? = Xy; 
solve for a and b, 
The solution: Divide (1) by Zx? to obtain 
(3) 


a+ b&xy/ y: — Zxz| Xx, 
Divide (2) by Zxy to obtain 


(4) a+ b2y/ Ixy = Zyz/ Ixy, 
Subtract (4) from (3) to obtain 
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(5) bXxy|Ex? — bLy*/Zxy = Zxz|Ex* — Xyz| Exy. 
Remove the common factor, b, on the left side in (5) and write 
b(Xxy|Ex? — Ly?/Xxy) = Zxz|Ex? — Xyz| Exy. 
Divide both sides by the coefficient of b to obtain the solution, 
Pz Zxz| Ex? — Zyz|Zxy, 
Axy|Ex* — Ly?/Xxy 


Note that the final solution for b can be written in a different form. If we 
multiply the numerator and the denominator (above and below the bar) by 


Zx? xy, we obtain 


_ ExyXxz — AxtXys 
ge (Exp — Extxày? 


By a similar procedure, which will not be presented here, we could solve 
for a. We would find that 
Ayz| Zy? — AXxz| ZXY 


2 


a= xy Sy? — Zx%/ Sxy 


which can be rewritten as 
Ixy Lyz — Ayxxz: 
a= (2xy)? = dx? Ly? 
b, by the formulas derived above, if 


We could compute the values of a and ! 
ted by the various sums of squares 


we were given the actual numbers represen 
and products: Xx?, Sy, Zxy, xz, and Zyz. 


Limits and Indices 


As the summational notation was described above, the number of terms in 
the summation was understood to be N and the summation was understood 
to involve all terms from the first to the Nt, but other possibilities must be 
taken into account. There is à conventional way of making explicit the 
number of values in the entire set to which the summation applies, and the 
last term in the series of terms. A number is written above the summation 
Sign. The number is called the upper limit of the summation. Several examples 


are listed below. 


N . n 

In the expression, X X, the limi 
and that the summation ends with the N 
Summation begins with the first term and includes all N terms. 


it, n, indicates there are n values of Z and 


t, N, indicates that there are N values of X 
Nth value. It is understood that the 


n : 
In the expression, 2 Z, the lim 
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the summation ends with the nth value. It is implied that the summation 
begins with the first term and involves all n values of Z. 


d Pi 
In Z Y, the limit, c, indicates there are c values of Y and the summation 


ends with the c^ value. It is understood that the summation begins with the 
first term and includes all c values. 


8 

In & X*, the limit indicates there are eight values of X and, consequently 
eight values of X? It also indicates that the summation ends with the eighth 
value. It is implied that the summation begins with the first term and includes 
all eight terms. 

In each of the expressions given above, the summation begins with the 
first term but it is Possible to begin with some term other than the first. There 
is a conventional way of writing a summation to make explicit where in the 
series of terms it begins. A number is written below the summation sign. It 
is referred to as the lower limit. The lower limit indicates the number of the 
term with which the summation begins and the summation is then understood 
to include all succeeding terms including the last as indicated by the upper 


limit. Examples of expressions in which both limits are made explicit, are 
given below, with their definitions. 


N 
OK = My he hy 34.24 X. 


2 tic XQ. + Xm 
eo In our work the summation will always begin with the first term, 
es Will be no need to write the lower limit, On Occasion, it will be necessary» 
ever. 


l i IS à variable, a is a constant, the lower limit 
is understood to be l, 


and the number of v. ; ; The 
Li » alues N. 
Symbol, i, is called the index of summation of thie aeration 


In the expressi ) i 
on, Jay, x. is Š " N . " 
g ^ A; Is the variable in the summation, a is a com 


RR 
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stant, a lower limit of 1 is implied, the number of values of the variable is 
n, and j is the index of summation. 


m 
In the expression, Z a; Y;, Y; is the variable in the summation, a; may vary 
i-1 


but in this summation is a constant, the lower limit is 1 and is made explicit, 
the number of values of the variable is m, and i is the index of summation. 


In the expression, Sax; Y,, there are two sets of paired values, X; and Y;. 


The index of snam is i. There is a constant, a. There are n terms in the 
summation. 

When a set of numbers is classified into subsets, the succession of terms 
in a summation can be formed in more than one way and it becomes neces- 
sary to specify the particular group or series of terms intended by the sum- 
mation. This specification is also made possible by the adoption of an index 
of summation for each way of summing. In a later chapter dealing with the 
analysis of variability for classifications of numbers, limits and indices will be 
discussed more fully and it will be shown how they considerably extend the 


range of application of summational notation. 


When there is no need for limits and indices, we shall omit them. 


MOMENTS 
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Moments constitute a class of values which may be computed for any set 
of numbers by the application of a general formula. To compute a moment 
for a set of N numbers, one raises each number in the set to a specified power, 
sums these powers, and divides this sum by N. Particular moments are 
identified and named by reference to the power to which the numbers are 
raised, that is, by reference to the exponent employed. Thus the mth moment 
in the class of moments is defined as follows: 


The m*" moment of a set of N numbers is the sum of the mt” powers of the 
N numbers, divided by N. 


Moments of X 


In summational notation, the mt" moment of X is written 


Zym 
N 


, 


the first moment of y is written 
the second moment is 
the third moment is 


and the fourth moment is 
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We could, of course, continue indefinitely in the writing of higher moments. 


Notice that the zeroth moment is 


which can be written 


THE MEAN 
The first moment of Y is called the mean of X. We shall employ M; as the 
Symbol representing the mean. (X is employed by many: authors for this 
purpose.) Thus z 
X 
M, is computed as indicated by formula (4.1), that is, by summing the N 


numbers in a set and dividing the sum by N. 
Example: Given the numbers: 4, 4, 6, 6, 6, and 10, we let X represent any 


value in the set and N, the number of values in the set. Then 


EX = 36 
and 
2X 36 6 
Mh, ae ig c 


Moments of x 


We propose transforming X to x by the formula: 


x= X — Me 
refers to any number in the original set; M; is the mean or 
x is any number in the new set which 
deviation from the mean. 


In the formula, Y 
first moment of the original values; 
results from the transformation and is called a 

Consider now the first four moments of x. In summational notation, the 


first moment of x is written 
2x 
—d 


the second moment of x is 
P 
Y 


N 
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the third moment is 


ax 
N 
and the fourth moment is 
ax 
N 
Notice that the zeroth moment of x is 
Zx? 
N 
which can be written 
AD. N 
N N 
Notice also that the first moment of x, 
Zx 
N 
can be written 
X(x — M;) 
N 


an expression which can be simplified, as follows: 
ZX — ZM, _ EX — NM, E 
N N Nw 7 Ma — Ms 


THE VARIANCE 


The second moment of x is called the variance. We shall employ V, as the 
symbol for the variance. AV. is called the standard deviation and will be 
represented symbolically as S+. It is evident that V, = S2, 

Example: Given the numbers: 4, 4, 6, 6. 6, and 10, we found the mean to 
be 6. Transforming the original numbers by the formula, 


x= X— M, 
yields a new set which Consists of —2, —2, 0, 0, 0, and 4. Then 
d BB = (I (99-0 OF a D iod 
an 
Ax? 24 
Ld dr = 
Note that S, = JV, = 44 — 2. 
The formula used above, 
Ax 
duin 


is not generally convenient for computing. We can rewrite it and apply the 


Moments of Z, : 


rules of simplification to obtain a computing formula which is especially well 
suited to machine computation. 
The expression, 


can be written as 


It should be evident that, if we obtain a computing formula for the numerator, 
2(X — M), which is the sum of squares of deviations from the mean, the 


final computation of V, will require only division by N. A" : 
Since M, is a constant for any given set of numbers, simplification of the 


numerator proceeds, as follows: 
X(Y — Ma} = (X? — 2M,X + M?) = ZX? — 2M,ZX + NM; 
We can now substitute 2X/N for M; and obtain 


=X)? (2X)? 
axe — 22% zy 4 GD rei + 


Quy 
N 
ms on the right yields a computing 


Combining the second and third ter 
formula for X(X — M)", 


zn — Gm. (4.2) 
Keep in mind that formula (4.2) gives Sx? which must be divided by N 
to obtain V,. That is, (Ex? 
DAA = 
Zx? N 
V. =N N 


To compute the standard deviation, S,, one finds the square root of the 


variance, V... x n 
Example: For the numbers: 4, 4, 6, 6, 6, and os a ZX 240, 
and N = 6, Substituting these values in formula (4.2) y 


367 
—_—— = 24. 
240 6 


Then V, = 24/6 = 4 and S, = 2. 


Moments of Zx 


Consider next a transformation of x to Z+ by the formula, 


55 Moments 


In mathematical statistics, Z, is called a standardized variable. In psycho- 


: e » 
logical statistics, Z is usually called a standard score or simply a “ z-score. 
The first moment of Z, is written 


DZ 
P 
The second moment of Z, is 
AZ 
Y 
The third moment is 
ZZ 
=e 
The fourth moment is 
2 
Notice that the zeroth moment of Z,, 
AZ? 
N^ 
can be written 
X0) N 
N NTL 
and that the first moment ot Z.. 
ZZ, 
Pa 


can be written 


ZOIS) (Q/S)Zx — (1/S,)(0) 
N N = Ww =? 


Notice also that the Second moment of Zi 


zz 
— Ki 


N 
can be written 


A/S.) Ds) q Sax? 1 


THE INDEX OF SKEWNESS 


The third moment of Z, is called the index of. 
here by the symbol, S. (The symbol, B,, 
is widely employed.) 


skewness. Itwill be represented 
was suggested by Karl Pearson and 


"The student should not confuse S, the index of ske i d deviation 
of the variable Y. The Subscript, x, d Gani quib UP MR OdRA 


istinguishes the Standard deviation. 


Moments of Zz 73 
Example: Given the numbers 
4, 4, 6, 6, 6, and 10, 
for which M, = 6, V, = 4, and S, = 2, the numbers can be transformed to 


the deviations: 
—2, —2, 0, 0, 0, and 4. 


The deviations can be transformed to standard scores by the formula, 
X 
Sz 
The standard scores produced by the transformation are 
—1, —1, 0, 0, 0, and 2. 


Z: = 


Then 
EZ? = (—1} + (—1* + OF 0 +0 4+ 2 —6 
and 
Az; 6 — 1 
CUN 6. 


The formula for S employed above is seldom convenient for computing 
because the standard scores are usually troublesome to obtain. A computing 
formula can be derived by substituting 

(X — Mz) 
S. 
for Z, in ZZ3/N producing 
XX — My. 


S=— Ns? 
The numerator can be rewritten as 
Zu — 3X°M, + 3XM4 — M9 


and rewritten again as 
Xy — 3M, 2X? + 3Mi£X — NM$, 
and finally as 
EX? —3M,ZX? + 3NM3 — N 
The formula for the index of skewness now becomes 
EX? — 3M:ZX? + 2NM$ (4.3) 
S = 7 NS? . 
s more convenient for computing the index of 
formula, S = 2Z3/N, it still represents a 
lator even when N is only moderately large. 
available for computing the higher moments 
do the task with speed and accuracy. 


M3 = EX* — 3M,ZX* + 2NM3. 


Although formula (4.3) i 
skewness than is the defining 
forbidding task on a desk calcu 
Fortunately, programs are now 
on electronic machines, which can 


74 
THE INDEX OF KURTOSIS 


Moments 


The fourth moment of Z, is called the index of kurtosis. It will be 
represented by the letter, K. (Karl Pearson used f, for this value.) 
An example: The numbers 


4, 4, 6, 6, 6, and 10, 


were transformed to standard scores in the preceding section. The standard 
scores which were produced by the transformation are 


—1, —1, 0, 0, 0, and 2. 


Then 
ZZ% = (—1) + (—1) + 08 + 01 + 0! 4-2: = 18 
and 
£EZi 18 
Kex 


As is true of the index of skewness, the index of kurtosis is not, in general, 
easy to compute. The formula employed above can be rewritten, as follows: 


K — 222 _ A/S.) Zeus) ze — XX—MO 
N N N NS Nv? 
The simplification rules can then be applied to the numerator to produce 


ZQ* — AX*M, + 6X?M2 — AXM3 + M3) 
which is equal to 


2X4 — AM,ZX* + 6MZZX? — 4M3 EX + NM. 
After the last two terms are c 


! ombined, the final computing formula for the 
index of kurtosis is 


De 
eu 2X4 — 4M, 5X? i dmn — 3NM$. (4.4) 
z 


possible on an automatic desk calculator but can be 
er speed and accuracy on an electronic computer. 


Computation of K is 
done with much great 


The Descriptive Moments 


The student has Probably noticed that we have paid special attention tO 
the first moment of X, the second moment of x, the third moment of Zz, an 
the fourth moment 
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numbers will be considered first. Their properties with reference to a fre- 
quency distribution of those numbers will be considered second. At this 
point in our discussion, those logical properties having to do with error will 
be disregarded. Later on, in the discussion of random variation, logical 


properties having to do with error will be considered. 


PROPERTIES OF M. 


Listed below are five logical properties of the first moment, or mean, of X. 


1. Inspection of the formula, 


and the equation, 
EX = NM. 


Which is easily derived from the formula, reveals that the mean of a set of 
numbers is the constant value which results when the sum of the set is 
divided into N equal quantities. 

2. The formula, 


can be rewritten as 
1 l ls ix 

M; -w Twy +H Xs +H Xv 
If we let p — 1/N and substitute, the preceding statement becomes 
M, = pX, + PXs 4 pX + ss + PXN 
llows: The mean is a number constituted by 
combining equal proportions of the numbers in the set. Because the mean has 
this property, one might think of it as a representative value. (It should be 
Noted that p is a constant and that it is the proportion contributed to the 
value of the mean by each number in the set.) “ Representative,” as used 


here, does not mean typical or most frequently occurring. 


3. Given the numbers: 


and can be interpreted, as fo 


Xy Xo Xs XN, 
s no lower algebraic value than X, and no higher 


Where Y, < Xy and there i 
be shown that 


algebraic value than Xw, then it can 
X,« Ma < Xn. 
e mean will lie between the lowest 


When the numbers in a set vary in value, th i tweer 
be said to be an interjacent value. 


value and the highest. That is, the mean can 
(A proof will not be given here.) 
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Caution: The mean will not necessarily lie midway between the two 
extreme values on the scale of X and will not necessarily lie in the middle 
of the distribution of values, with as many values above it as below. 

4. We observe that 


for the reason that 


2(X — M,) = 0. 
That is, we showed that 


A(X — Mj = Xx — IM, = ZX — NM, = ZX — XX =0. 


Thus we can say that the sum of deviations from the mean is zero, which 
statement implies that the absolute value of the sum of negative deviations 
is equal to the absolute value of the sum of positive deviations. 

Caution: The fact that the sum of deviations from the mean is zero does 
not imply that for every positive deviation there is necessarily a corresponding 


negative deviation of equal absolute value, although such correspondence 1s 
possible. 


5. It can be shown that 


A(X — M, < X(X — ay 


when a # M,. In words, we can state that the sum of squares of deviations 
from the mean is less than the sum of squares of deviations from any other 


reference value. That the mean has this property can be proved, as follows: 
Consider the expression, 


XXX — cy; 
it can be viewed as bein 
for the value of the fu 
expression can be simp 


AX — cz = 


g a function of c. We may ask what value c must have 
nction to be a minimum for any set of numbers. The 
lified and rewritten as indicated below. 


A(X? — 2eX + c?) = EX? — EX + Ne 
= Ne? — 26D KX 4+ D2 


Ne — NEX + NEY? 
N 


(Nc — ZX} — (Sx + NZX? 
x i 


from which we see that 

they would be for an 

depends only on c. 
We know that 


if N, ZY, and Zy? 


are treated as constants, which 
y given set of numbers, 


then the value of the function 


(Ne — Ex} 20 
and that the value of the function will be a minimum when 


The Descriptive Moments 71 
(Nc — 2X)? = 0, 
in which case 


Ne — 2X —0 
and 
AX 
c= = Me 


Therefore, the value of the function, 


UX = oP 
N 


is a minimum only when c = Ms. 

Summary of the properties of the mean: 

1. The mean is a value which results when the sum of a set of numbers is 
divided into N equal parts. 

2. The mean is a representative value, constituted by combining equal 
proportions of all N numbers in a set. (It is not necessarily a typical value or 
the most-frequently-occurring value.) 

3. The mean is an interjacent value, 
à set of numbers. 

4. The sum of deviations from the mean is zero. 

5. The sum of squares of deviations from the mean is smaller than the 
sum of squares of deviations from any other reference value. 


with respect to the extreme values in 


PROPERTIES OF V. 


Two properties of the second moment of x, or the variance, are presented 
next. 

l. Let (XY, — X) be the difference be 
number in the set including itself. The m 


tween a particular number and any 
ean of these N differences, 


is equal to 


ZX,— + ZX NX, zii = X, —M, =x}. 


Thus a particular deviation, x,, can be said to be a function of the differences 


between X, and the N numbers in the set. That is, 
3(X, — 2X. 
XB Nm 


Now the variance is a function of deviations from the mean as indicated by 


the formula, ze 
x 


E. N 
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Since each deviation is a function of a set of differences and the variance is = 

function of all of the deviations, the variance can be said to be a A sd 

the differences among the numbers themselves. That is, the magnitude of th 

variance depends on the magnitude of the differences among the numbers. 
2. Recall that the value of the function, 


A(X — oy, 


is a minimum for any set of numbers when c= M,. It follows that the a 
moment of (X — c) is a minimum when c — M, that is, when the secon 
moment is 


AX—My xe 
-N CON 
Which is the variance as we have defined it. Thus the variance is the minimum 


value of the second moment of (X — c) where c is a constant that is to be 
subtracted from each value of Y. 


PROPERTIES OF S 


The logical properties of the index of skewness will be discussed next. The 
index of skewness has been defined as 


— 222, 
CUN 


S 


, ; ix in the 
Note that S is a pure or abstract number in that it is not expressed in th 
units of the original scale of b 


> as are M, and V, The units of the original 
scale cancel in 


zs 


and the units of the new scal 
is said to be dimensionless, 
Recall that 2x/N = 
always equal to zero, i 
deviations is equal to the 
Recall also that 2Z,/N 
case the absolute value 
absolute value of the s 
moments, however, does 
of absolute value equal to 
although that situation could 


€ of Z, are actually standard deviations. Thus $ 
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could happen that positive cubes would cancel negative cubes completely in 
ZZ}, resulting in a value of S equal to zero. This cancelling will not neces- 
sarily take place in every case as it does with 2x/N and ZZN. 

One situation in which the cancelling is complete and S is equal to zero 
is that in which there is a positive Z, of absolute value equal to the absolute 
value of each negative Z,. There are other situations in which this correspond- 
ence does not obtain but the cancelling is complete and S is equal to zero. 

When the absolute value of the sum of the positive values of Z3 exceeds 
the absolute value of the sum of the negative values of Z3, then S is positive. 
That is, S > 0. When the absolute value of the sum of positive values of Zł 
is less than the absolute value of the sum of negative values of Zł, then S 
is negative. That is, S < 0. There is, in general, no upper or lower limit on 
the value of S. 


PROPERTIES OF K 


The logical properties of the index of kurtosis will be considered next. The 
index of kurtosis has been defined as 


gas Iz 
N 
pure or abstract number. It can 


As is true of S, K is a dimensionless value, a 


be shown that 
KZS +l. 


alue, K = 1, can occur only when 


It should be apparent that the lowest v 
be shown that 


S = 0. K has no upper limit. It can also 
g=% +i 
f Z, and that 


only when there are two distinct values o 
K=1 
= 1) only when the two distinct values 


(which means that 2Z!/N = 2Z:/N 1 
se the two values occur with equal 


of Z, are +1 and —1, in which ca 


frequency in the set of numbers. i 
Given a set of numbers in which there is a positive Z, of absolute value 


equal to the absolute value of each negative Z,, in which case S = 0, 


K>1 


greater than +1 and the number of values of 


if the number of values of Zs 
less than the number of values of Z, between 


Z, less than —1 are, together, 
—land +1. 
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Frequency Distributions 


The numbers in a given set can be classified according to their values HE 
the results of this classifying can be conveniently summarized in aee f 
distribution which represents frequency as a function of the magnitude o i ^ 

Consider a set of M numbers in which the distinct or different values ar 


Xo Xs Xs... X, 


1 ? is 
and fı is the number of Xy's, fy is the number of X,'s, and so x 
customary to say, in general, that f is the “ frequency of X." Note t 


Aithtfe+...+f, = Sf=N. 
The frequency distribution of X is shown in Table 4.1. 


Table 4.1 
FREQUENCY DISTRIBUTION FOR A SET OF N NUMBERS 
x f 
X A 
Xa f: 
P: h 
E, f. 


r = number of classes. 


f = frequency of X in each class. 
af = N. 


Notice that the notation has been alter 


one number in the set. Here, in the alter 
a class of 


One possible graphical re 
a histogram. Ina histogram 


Table 4.2 presents a frequenc 
which range in value from 1 to 


in graphical form the frequenc 


Y distribution for a set of N — 48 num 
5. Figure 4.1 is a histogram which represen 
y distribution of Table 4.2. 
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Table 4.2 
A FREQUENCY DISTRIBUTION 
x f 
1 1 
2 18 
3 12 
4 14 
5 3 
N 2 Zf — 48. 
30 
25 
20 
fis 
10} 
É 
gr tst 3 5 6 
x 


Fig. 4.1 A histogram representing the frequency distribution of Table 4.2. 


Frequency Distributions and the Descriptive Moments 


can be written to take advantage 


Formulas for the four descriptive moments 
y distribution. The sum of the 


of the classification provided by a frequenc 
N numbers in a frequency distribution is 
fix fos + f3X3 + ++: + SX e 
Therefore, the formula for the mean can be written 
r 
2fX, 


Mz N 


The sum of squares of deviations for the N numbers is 


fel Ade pfa + ++: + SXF 
and the variance can be written as 
Efe 
N 


Ve = 
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This principle can be extended to the index of skewness to yield 


Xfzà 
m m 
and to the index of kurtosis to yield 
Xfzi 
k= 


LOCATION OF A DISTRIBUTION 


With respect to a frequency distribution or its histogram, the mean is a 
measure of location. If we fixed or held constant all other moments of a 
distribution but allowed the mean to vary, the size and the form of the distri- 
bution would not change but the histogram would move to the right as the 
mean increased in value and to the left when the mean decreased in value. 
Therefore, we say that the mean indicates the location of the distribution. 
Figure 4.2 shows two histograms based on distributions with equal total 
frequencies, variances, indices of skewness, and indices of kurtosis. Only the 


means of the two distributions differ, a circumstance which places the two 
histograms in different positions on the scale. 


+ + 


H 

H 

d 

H 

i 

H 

H H 

i H 
=] TL. 

pe " — 

x 


Fig. 42 Two histograms based on distributions wi 
frequencies, variances, indices of skewnes: 


with dotted lines represents the distributi 


th unequal means, but equal tola! 
s and indices of kurtosis, The histogram drav 
on with the larger mean. 


DISPERSION OF A DISTRIBUTION 
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— 


butions with unequal variances, but equal total 


Fig.4.3 Two histograms based on distri 
d indices of kurtosis. Histogram A has a larger 


frequencies, means, indices of skewness, an 
variance than histogram B. 


SKEWNESS OF A DISTRIBUTION 


S has been called an index of skewness. Skewness refers to the symmetry 
or asymmetry of a distribution or its histogram. A distribution which is 
symmetrical about the mean is called a symmetrical distribution. This implies 
that the histogram is to be divided into two parts by erecting a perpendicular 
at a point on the horizontal axis corresponding to the value of the mean. In 


a symmetrical figure, for any point in the part of the histogram above the 


mean, there is a corresponding point in the part below the mean. These 


corresponding points have the same ordinate and the absolute values of 
their abscissas are equal when the Y's have been transformed to deviations 


from the mean. 


_ When the distribution is symmetrical, S nS ; | 
is asymmetrical. When S < 0, we say the distribution 1s negatively skew. 


When S > 0, we say the distribution is positively skew. . 
In Figure 4.4 are two histograms for distributions with equal total 


tributions with equal total frequencies, means, 
al indices of skewness. The index of skewness 


= 0. When S + 0, a distribution 


Fig. 4.4 Histograms representing two dis 
Variances, and indices of kurtosis, but unequ 
for histogram A is 0.25; for B, it is 0.50. 
frequencies, equal means, equal variances, and equal indices of kurtosis, but 


with unequal positive indices of skewness. eee . 
Caution: We have said above that, when a distribution is symmetrical, 
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S = 0, and that, when S + 0, a distribution is asymmetrical. One cannot say, 
in general, that, when S = 0, a distribution is symmetrical. One can say that 
when all the odd moments of Z, are zero, the distribution is symmetrical, but 
S, the third moment of Z,, can be zero while one or more of the higher odd 
moments are not zero. Figure 4.5 presents two histograms with equal total 


. Fig. 4.5 Histograms for distributions with equal total frequencies, means, variances, 
indices of skewness, and indices of kurtosis. S = 0 and K = 2.6 for both distributions. 


Histogram A is symmetrical, but B is not. They differ with respect to their fifth moments. 
The fifth moment of Z, for A is zero; for B, it is 0.50. 


frequencies, equal means, equal variances, and equal indices of kurtosis, and 
with indices of skewness equal to zero. Histogram A is symmetrical; B is not. 


KURTOSIS OF A DISTRIBUTION 


The index of kurtosis is said to reflect the flatness or peakedness of a 
histogram, but the appropriateness of these terms is not readily apparent 1n 
the case of distributions with extreme values of K. The index is also said to 
reflect the concentration of frequencies at or near the center of the distribu- 
tion, relative to the concentration in the tails. Figure 4.6 shows a discrete 


—— 
-2 -1.5 -1 -0.5 0 405 41415 42 
Z 


Fig. 4.6 A frequency distribution of Z, for which S = 0 and K = 1. 


distribution of Z: for which K = 1, the lower limit of the index. Notice the 
equal concentrations of frequencies at —1 and +1. In Figure 4.7 are two 
histograms which differ only with respect to kurtosis. Notice the difference 


between the two histograms with respect to the relative concentrations of 


e 
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d ——————— —t— | $—$+$ ++ +t 
x x 


ributions which differ only with respect to their indices 


Fig. 4.7 Histograms for two dist 
5; the same moment for B is 3.5. For both histograms 


of kurtosis. K, for histogram A, is 1. 
S is zero. 


| and in the intervals at the extremes. 


We have seen how moments can be computed from a frequency distribu- 
tion. It is possible to reverse this process. There is a determining relation 
between the moments of a distribution and the frequencies in that distribu- 
tion. It is in the nature of the relation that one can choose values for the 
moments and then compute the relative frequencies determined by the chosen 
values. (The computational procedure, which involves the solution of a set 
of simultaneous equations, will not be given here.) The number of moments 
required to fix the distribution depends on the number of distinct intervals 
in the range of the entire distribution. The zeroth moment can be counted as 
One of the required number. For example, five moments—the zeroth, first, 
second, third, and fourth—would be required to determine the frequency 
distribution of a set of numbers in which there occurred the five different 
values: 1, 2, 3, 4, and 5. The frequency distribution of a set of numbers 
whose range covered six intervals on the scale of X is not fixed by five 
moments. In Figure 4.5 are two six-interval histograms with equal total 
frequencies, means, variances, indices of skewness, and indices of kurtosis. 
The two histograms differ with respect to their fifth moments. Histogram A 
has a fifth moment of Z, equal to zero; the same moment for B is 0.50. Both 
histograms have values of S equal to zero, but A is symmetrical and B is 


Obviously not symmetrical. 


frequencies in the middle interva 


Other Common Transformations 


The numbers in a set may be transformed by the addition of a constant 
to each number, the subtraction of a constant from each number, the 
multiplication of each number by a constant, or the division of each number 
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by a constant. The result of one of these operations, ora combination of 
them, is a new set of numbers. It will be instructive to examine the four 
descriptive moments of the new set in relation to those of the old set. When 
numbers are transformed in this way they are often said to be coded. These 
transformations are widely used and are of considerable importance. 


First, we shall describe the consequences of adding a constant to each 
number in a set. 


Let X be any number in the original set of N numbers and let a be a 
constant. 


Let X' be a value computed from X by the formula: 
EFFET? 

Then X' is any number in a new set of N numbers. 

Let M. be the mean of the new set. Then 

2X 

O N 

We can substitute (X + a) for X’ and obtain 


AXX + a) 2X + Na 
My = N N — = M. + a. 


My 


Conclusion: Coding by adding a constant adds the constant to the mean. 


By similar methods we could show that coding by subtracting a constant 
subtracts the constant from the mean. 


Let V. be the variance of the new set. Then 


y, = 2X, ZO = Mey 
- N N 
We can substitute (X + a) for X' and (M, + a) for M» to obtain 
y, — ZIX + a) — (M, + a) 
at N m p 


which can be rewritten as 


A(X — M, 
B=. = 
N x 
since the constants in the numerator cancel. 


Conclusion: Coding by adding (or subtracting) a constant does not change 
the variance. Nor would the standard deviation be affected. 


Let S’ be the index of skewness of the new set. Then 


gi a ZZ _ Ux"/s2) mx? 
N N ~ 


Examine the numerator, Zx”, By definition, 


NS3, 
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x —(X' — M). 
We can substitute (X + a) for X’ and (M. + a) for Mx, to obtain 
x' =(X +a) — (Mz +4) =X — M. =. 

We have already found that Sy = S,, that is, coding by adding or sub- 
tracting a constant does not affect the standard deviation. Therefore, if we 
substitute x for x’ and S, for Sy, we find that 

ex? 


er 


Conclusion: Coding by adding (or subtracting) a constant does not affect 


the index of skewness. 
Let K' be the index of kurtosis for the new set. Then 
, Zzb. Hafn D 
Ki=—y ON NS 


but we have determined previously that x’ = x and S, = S,. Therefore, 


Conclusion: Coding by adding (or subtracting) a constant does not affect 


the index of kurtosis. 
Next we shall consider the consequences of multi 
number in a set by a constant. 
Let X' be a value computed from X 
X' — aX. 


plying (or dividing) each 


by the formula: 


Consider the mean of X’. By definition, 
E. 
My ="w 


We can substitute aX for X and obtain 
Zax | aXX 
ies me 
by a constant multiplies the mean by the 


essed as equivalent multiplication, we 
tant divides the mean by the constant. 


Conclusion: Coding by multiplying 
Constant. Since division can be expri 
also conclude that dividing by a cons 

The variance of X' is, by definition, 

w" XX = My) | 
Ve = N N 
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ituti i Mx» yields 
Substitution of aX for X' and aM, for S , 
A(aX — aM, Zax — M.) a XX — Mj _ 
Ve N T N N 
Conclusion: Multiplying by a constant multiplies the variance by the square 


of the constant. Dividing by a constant divides the variance by the square 
of the constant. 


Note that if 


a*V,. 


Vs = P 
then 


We = S, = JatV, = aJV, = aS,. 


Conclusion: Multiplying by a constant multiplies the standard deviation by 


a constant. Dividing by a constant divides the standard deviation by a 
constant. 


Consider next the index of skewness for the new set of numbers. We have 
already observed that 


" Ax 
~ NS& 
Let us examine x' in the numerator, Dx”. By definition, 
ges X^ ws 
and, after substitution, 
x —aX — aM, = a(X — M,) = ax. 
Then 


2x3 č ax 


NS} ^ NSÀi' 
Recall that S, = aS,. After substituting, we find that 


Ss 


, a? Xx? 2x3 

S = waist = sa = S- 
Conclusion: Since S' — S, 
affect the index of skewness. 
Finally, let us consider K’, the index of kurtosis for the new set of numbers. 

We have already established that 
; axe 
K= NV2, 


coding by multiplying (or dividing) does not 


, 


that x' — ax, and that Ky 


= a*V,. We can substitute ax and a?V, in the 
formula for K' and obtain 


K Za'xi gx _ Zx 
Nat ~ NdVi— NV = 


Conclusion: Since K' = K, coding by multiplying (or dividing) by a constant 
does not change the index of kurtosis, 
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The effects of coding on the four descriptive moments and the standard 
deviation are summarized in Table 4.3. 


Table 4.3 


EFFECTS OF CODING OPERATIONS 
ON THE DESCRIPTIVE MOMENTS 


Operation 


X+a aX X/a 


Moment Sy 


e are often employed in computing. 
f computing means, variances, and 
re or class intervals in a 


The coding operations described abov 
Traditional paper-and-pencil methods o 
standard deviations for numbers classified into sco 
frequency distribution have involved the following steps: 

1. One chooses, as a new arbitrary origin, the midpoint of an interval, 
usually but not necessarily near the center of the distribution. (This new, 
arbitrary origin is often called an " assumed mean " or a “ guessed mean.") 

2. A “zero” is assigned to the interval containing the new origin. Then, 
Working in both positive and negative directions from this interval, one 
assigns small whole numbers, with appropriate signs, to all other intervals. 

3. The resulting, new scale values are employed in the computation of a 
mean, a variance, and a standard deviation. 

Implicit in this procedure are the coding operations, subtracting a constant 
and dividing by a constant. The constant which is subtracted is the midpoint 
Of the interval containing the assumed mean. The constant divisor is the size 
Of the class interval. Moments computed from the coded values must, of 
Course, be uncoded by the reverse operations. The variance must be multiplied 
by the square of the constant divisor; the mean must be multiplied by the 
Constant divisor and that product must then be added to the constant which 
Was subtracted. 

Of more interest to us here than these paper-and-pencil methods are 
Computational procedures for an automatic, desk calculator. It is often 
Convenient, when one is working with large numbers, to subtract a constant 
from each as the numbers are entered in the calculator. For example, if one 
had a set of numbers ranging in value from 105 to 181, it would be a simple 
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matter to drop the hundreds digit and compute with numbers ranging from 
5 to 81. It is also convenient sometimes to ignore decimal points. Suppose 
the original numbers had values from 0.23 to 1.91. One could drop the 
decimal point, an action which is equivalent to multiplying each number by 
100, and compute with numbers running from 23 to 191. Whenever numbers 
are coded to facilitate computation, one must, of course, take account of the 


effects of coding and, when it is necessary, uncode computed values to obtain 
final answers. 


Special Transformations 


We shall mention and describe only briefly five transformations which are 
of some considerable importance in later work in statistics. We shall not 
attempt to show their specific effects on all of the descriptive moments. 

l. Squaring. Let X represent any number in an original set of numbers. 


Let Y represent any number in a new set and, further, let Y be defined by the 
formula: 


Y= ys 
Example. Consider the original set of numbers: 
1, 2, 3, 4, and 5. 
Squaring each produces a new set consisting of the numbers: 
1, 4, 9, 16, and 25. 


Notice that the original set is symmetrical (S = 0) and that the`new set is 
not symmetrical (S > 0). 


2. Extracting the square root. Let Y be defined by the transformation 
formula: 


Y = Jx. 
Example. Let the original set consist of the numbers: 
4, 9, 16, 25, and 36. 
Applying a square-root transformation produces the numbers: 
2, 3, 4, 5, and 6. 


The distribution of the original set is not symmetrical; that is, S > 0. The 
new distribution is symmetrical with S = 0, 


3. Transforming to logarithms. Let Y be defined by the formula: 


Y = log. X. 
Example. Given the set of numbers: 


0.032, 0.320, and 3.200, 
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we can rewrite these three values as 
0.320 x 10-1, 0.320 x 10°, and 0.320 x 101. 


We might find it convenient to code these numbers by dividing each by 0.320. 
After coding, the numbers would be 


10-1, 10°, and 10!. 


The logarithmic transformation, applied to the coded values, would then 
yield 
—], 0, and +1. 


We would undoubtedly find it more convenient to work with the transformed 
values than with the original values. Notice that the distribution of the 
transformed values is symmetrical whereas the original distribution was 


positively skew. 
4. The inverse sine transformation. Let p be any number in a set such that 
Ospsil. 
A transformation of p to 0, an angular measurement, is given by the formula: 
6 = sin Jp. 
The formula can be read: “ Theta is an angle whose sine is equal to the 
Square root of p." The transformation is effected by taking each value of p, 
obtaining its square root, and referring the root to a table of sines for the 
Corresponding angle. 
Example. Let the original values of p consist of the numbers: 
0.03, 0.12, 0.25, 0.41, and 0.59. 


The angles whose sines are the square roots of these original values are, 


approximately, — E 
10°, 20°, > > . 


The original values are positively skew. The transformed values are 


Symmetrical. 

5. The reciprocal transformation. 
9f numbers, Then the reciprocal trans 
Y = I/X. 
he asymmetry of the distribution of the 


Let Y be any number in an original set 
formation to Y is given by the formula: 


Example, It is evident that t 


Numbers: 
1, 1/2, 1/3, 1/4, and 1/5, 


is eliminated when the values are transformed to their reciprocals, which are 


1, 2, 3, 4, and 5. 
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Substitution of Ranks 


It is possible to substitute ranks for the original values in a set of numbers. 
We shall consider this substitution only for those sets of numbers for which 
the resulting ranks contain no ties. When there are no ties, the mean and the 
variance of the ranks can be expressed as a function of N. If N numbers are 
ranked without ties, the sum of the N ranks is the sum of the first N positive 
integers. That is, the sum of ranks is given by the formula, 


N(N 4-1 
ER, = NO +1) 
2 
where R, is the rank assigned to X. Then the mean of ranks is 
N+1 
M, co 


The sum of squares of the N ranks is given by the formula: 
2N 1 
zao MU E DON + D, 
The variance is 


2 (ZR) 
ZR, — S 


VS N 
which, after substitution and appropriate changes, becomes 
N? — 1 
Po mi 
7 12 


Example. Given the numbers: 


2, 5, 6, 10, and 11, 
the corresponding ranks are 
5, 4, 3, 2, and 1. 
— 5 and the sum of ranks is 


5G +1) 
(3 = 15 


In this example, N 


The mean is 15/5 = 3, 
The sum of squares of the ranks is 


56 + DQ x 5 +41) 
> = 


55 
and the variance is 
5—1 TT 
I2 ~ 
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Summary 


The discussion in the present chapter has consisted of a syntactical 
treatment of a class of values known as moments. The syntactical treatment 
dealt with the definitions of moments and the logical properties of the four 
descriptive moments—not including any properties which have to do with 
error. We shall leave syntactics for a time, but shall return to it in Chapter 7 
where another class of values called product moments will be introduced. 
Having laid a groundwork of symbolism in Chapters 2, 3, and 4, we have 
chosen to take up, in Chapter 5, one of the major semantical problems— 
measurement. After we have discussed the nature of confirmation in psycho- 
logical measurement, we will be ready to consider, in Chapter 6, the first 
topic in pragmatics: the use of moments in describing data. 


Table 4.4 
FIVE SETS OF JOINTLY CLASSIFIED NUMBERS 


Columns 

E w X y z 

1 11 42 3 12 101 
2. 86 122 2 10 146 
3 39 77 3 9 129 
4 91 88 2. 8 151 
5 16 62 4 13 136 
6 73 115 5 14 223 
7 30 45 4 12 150 
8 69 61 2 7 129 
9 27 57 2. 10 87 
10 72 103 2 6 132 
11 37 57 4 15 157 
12 35 39 4 10 155 
Rows 13 65 80 2 8 125 
14 71 74 2 6 131 
15 59 65 2 8 119 
16 82 103 2 8 172 
17 99 130 3 10 189 
18 11 42 3 11 101 
19 7 30 3 7 97 
20 24 62 4 14 144 
21 32 56 4 13 152 
22 88 107 4 13 208 
23 24 T 1 3 54 
24 19 29 4 10 139 
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Compute M, and V,, M, and Ve, M, and V,, M, and V,; in other words, 
compute the first and second descriptive moments for the variables: V, W, Y, and 
Z in Table 4.4. For variable X, compute all four descriptive moments: Mz, V 
S, and K. 

Code variable X by adding “ one " and multiplying by “ two." Then, for the 
coded values, compute the four descriptive moments: My, Vx, S’, and K’. Compare 
the moments for X^ with those for X. Could you have predicted what they would be? 


Example of the use of computing formulas for the mean and variance of variable V. 
Formula (4.1) for the mean is 


ZV 
M, =N" 
ZV = 1167 and N = 24, Then 
1167 
My = 24 ^ 48.625. 
Formula (4.2) for the sum of squares of deviations is 
(zyy 
E. ETAT. 
zy N 
ZV = 1167; ZV? = 76,795; N = 24. Then the sum of squares of deviations is 
76,795 — (1677 = 20,049.625 
and the variance is 
20,049.625 
Vy = g = 835.40. 


Exercise 4.2 


For each of the eight frequency distributions given in Table 4.5, compute the 
four descriptive moments: M,, V,, S, and K. You will find that the numbers are 
simple to work with and that an automatic desk calculator is not necessary. 


Table 4.5 
EIGHT FREQUENCY DISTRIBUTIONS 

(1) Q) (3) (4) 
x f x £f bd f X d 
1 1 1 1 1 0 1 0 
2 0 2 2 2 10 2 11 
3 6 3 6 3 0 3 3 
4 0 4 2 4 10 4 9 
5 1 5 1 5 0 5 1 

(5) (6) (T (8 
x A x f x i # x f 
1 0 1 2 1 1 1 0 
2 2 2 3 2 14 2 3 
3 3 3 15 3 24 3 3 
4 0 4 1 4 2 4 1 
5 1 5 3 5 7 5 H 
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Obtain graph paper and draw a histogram for each frequency distribution. Enter 


beside each histogram the values you obtain for the 


four descriptive moments. 


Study the differences among the histograms and the corresponding differences 


among the moments. 


Example of computation for distribution (1) of Table 4.5. 


Xf—N--8 
XfX = M) + 0Q) + 6) + 0(4) + 1G) = 24 
Py 
M, = HX = a3 
The frequency distribution of x is 
x f 
=o 1 
=| 0 
0 6 
+1 0 
+2 ! 
Sfx? = 1(—2)? + 0(—D* + 60)? + o? + 12)? = 8 
Set 8. 
V, = NU - = 1 


S, =V Va =y1 =1 
Since Z, = x/S,, the frequency distribution of Zz is 


+1 
+2 
zfz3 = (29° + o(— D° + 60) + OCD? + 107 
xz 0. 
s- %37. 
22s = 1(—2)' + -D* + 60% + 00) t 12)! 
32 _ 2 4 


KH ODOEN 


=0 


= 32 
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The totality of semantical issues and problems confronting the research 
psychologist is, without doubt, awesome. It would be ill-advised, to say the 
least, to attempt the resolution of all of these issues in the two chapters which 
are devoted to them in this text. Our goal is a modest one. In the present 
chapter there will be undertaken a preliminary and limited exploration of two 
specific questions about measurement. These questions are of concern to 
Tesearch psychologists and have special relevance to the study of statistical 
methods in psychological research. In Chapter 13, the problem of confirma- 
tion as it arises in statistical inference will be taken up. N 

The two questions to be considered in the present chapter are the following: 
(1) What is the nature of confirmation and disconfirmation with respect to 
the semantical designations of psychological measurement? (2) How do ideas, 
plans, and specifications for the preliminary tasks, operations, and materials 
originate in the initial period of test or scale development before confirmation 
is sought? . 

There is considerable agreement among psychologists regarding certain 
notions about measurement. Numbers have properties. Behavioral events and 


products have properties, too. When there is some kind or degree of corres- 
pondence between these two sets o 


to assign the numb 
pondence between 

inevitably, a fruitle 
employed to refer t 


Pondence does or does not obtain in 
Some given instance of attempted measurement. 


When one goes beyond these basic notio 
logical measurement and i 
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logists and others have had about measurement and the requirements of 
measurement. It does seem reasonable and appropriate to describe here two 
approaches to the problem, both of which figure, in varying forms and to 
varying degrees, in current discussions of measurement.! 


The Structural Approach 


If one takes for granted the desirability of correspondence between the 
properties of numbers and the properties of behavior, it is tempting to begin 
the quest for confirmation with a logical analysis of the properties of numbers. 
Having produced a list of the logically determined properties of numbers, one 
might then require a point-by-point demonstration of correspondence be- 
tween these properties and the observable properties of the behavior. 
* Demonstration” and “ observable” are key words here; the former 
implies that some kinds of operations, in which the objects or events being 
measured have a central role, must be performed by the investigator; the 
latter has the meaning one would expect an empirical scientist to give to it. 
For the empirical scientist, in general, measurement is quantified observation; 
it is his experience of the objects and events of the world, translated into the 
language of numbers. For the psychologist, measurement is the observation 
of behavior, given expression in the language of numbers. 

If a point-by-point determination of the correspondence of the two sets of 
Properties is necessary, how does one £o about staging the demonstration? 
Let us consider an example involving a familiar kind of physical measure- 
ment, the measurement of length. Everyone is acquainted with the fact that 
numbers can be incorporated in statements like those listed below. 


l. X=Y 
2 X>Y 
3. X, — XS Ho 
4. X, X, = AY: : 
Let these ae statements constitute our representation of at least some of 


the properties of numbers. So much for the numbers. 

For the other half of the example, we shall use ropes of varying lengths, 
chosen to suit our purposes. Now two ropes, A and B, have the property, 
A = B, when they are of the same length. That they are of the same length 
Can be determined, without reference to or use of numbers, by placing them 


horitative account of the views of particular 
he issues are SO complex, it is probable that the views of any 
curately only by that person. What follows is an attempt 
ositions. The student may, of course, readily identify 
scussion as having derived from the work of persons 


"No attempt will be made here to give an aut 


individuals. In an area where t 
Particular person could be given ac 
to formulate two representative P 
Certain elements of the present di 
Other than the author. 
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side by side, stretching them reasonably taut, and observing that T ce 
of one coincide approximately with the ends of the other. In the same a of 
situation, they have the property, A > B, when A is longer than B, mde 
course, is the case when one observes that the ropes coincide at one en : 
not at the other. Given four ropes—A, B, C, and D, they have the proper E 
A — B = C — D, when the amount by which A extends beyond B can A 
shown to be equal to the amount by which C extends beyond D. m 
that the set of relations can be demonstrated by arranging and observing th 
ropes. The demonstration would not require the use of numbers n any Ps 
To demonstrate the relation, AB = CD, requires a somewhat different € 
of operations. We would need a flat surface on which to mark off a Se att 
area using rope A for the one dimension and rope B for the other. We wo sh 
also mark off a rectangular area using ropes C and D. By subdividing bus 
area into some arbitrary unit area we could establish that the two rectang 
enclosed the same amount of space. . Wesen 
The structural approach to confirmation does not require that, in € S 
case, the demonstration of relations among properties of objects or -— 
must necessarily be reduced to the specific operations with ropes which Be "i 
described above. It is conceivable, although not necessarily true, that AM 
are many other sets of operations which bear little or no resemblance to m 
physical manipulation of ropes, but which would be appropriate for 
demonstrations. ‘hat 
What does seem to be clearly required by the structural approach is ee 
€ operations employed in demonstrating the properties of objects, ie 
distinguished from the properties of numbers, are of a primitive type whic 
does not involve the use of numbers and which does involve observation 
and the overt management of the observer. These primitive operations have 
the status of a criterion in that they constitute a procedure for confirming OF 
disconfirming the designation. A 
Our reason for emphasizing the primitive character of the cone 
operations and the important role of observation and management Is Los 
these are the features which guarantee the presence of the designatum In the 
evaluation of the sign-designatum relation. If one wished to evaluate ue 
method of psychological measurement and if there was not an obvious y 
appropriate set of primitive operations available, one could make the mistake 
of confining his attention to the numbers themselves and forgetting the 
behavioral events. The numbers, of course, would prove to have the pro- 
perties that numbers have. The mistake consists of leaving out of considera- 
tion the system of behavioral Properties. One would have to be very cleat 


å : e 
and certain that both sets of Properties had actually figured in th 
demonstration. 


th 


There are other pitfalls one must avoid in 


taking the structural approach. 
It is a mistake to try to find a substitute for 


the primitive operations, when 
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they elude one's search. An improper substitution, to which psychologists 
have been known to resort, has its origin in a misinterpretation of the role of 
observation in measurement and in science. Many psychologists are familiar 
with and agree with some form or variation of the statement: Measurement 
always involves a human observer. It is easy to rearrange this sentence into 
one which may seem to be logically defensible, but certainly is not: The action 
of a human observer always results in measurement. There is an important 
area of psychological measurement which has to do with the judging of 
physical and behavioral stimuli; it is called psychophysics. In this area, it is 
understandable that, in the absence of an obviously relevant set of primitive 
operations, the judgments themselves might be mistakenly interpreted or 
viewed as constituting these operations. This amounts to rewriting the 
sentence about measurement and the human observer, given above, into the 
sentence: The judging of any attribute in a quantitative fashion always con- 
Stitutes measurement. The concrete, manipulative operations which should 
be applied to the objects being measured have disappeared from the 
evaluation scene and the focus of attention is now on the act of observation. 

To maintain that observing the relation between two ropes placed side by 
Side is fundamentally the same as judging the pitch of two different tones 
presented successively, or judging the sourness of acid solutions of varying 
Concentrations, is to deemphasize the role of the primitive operations in 
determining correspondence and to shift the burden of confirmation else- 
Where. Perhaps that is just what should be done, but, if it is, we should 
recognize and acknowledge the fact. It may be that the important factor in 
confirmation is not the set of primitive operations, but is, rather, the quality 
of the observation itself. We shall not Say what we mean by quality at 
this point or how it might be evaluated, since much of what will follow 
Concerning the second approach will deal with these issues. JA 

There is another kind of inappropriate substitute for the primitive 
Operations—a-logical analysis of the behavior in question. To try to solve set 
Problem of confirmation by 4 logical analysis of numbers and a logica 
analysis of the behavior is to try to convert the problem from an T 
One to a theoretical one, oF from a semantical problem to a syntactica 
Problem. One might mistakenly resort to axioms, to a Lange E Pl 
theory—perhaps theory of a mathematical sort—in oe o bd 
measures with the desired attributes E ampi parpi sei 
them. The syntax of the language 1n wh à ; 
and the Tete. malls of behavior are reported might F aa ome 
would be devoid of empirical substance. The semantical ques 


confirmation would remain unanswered. 
Before leaving the structural approach, 
disturbing feature it possesses. It provides n 
sequences of failures in correspondence. The stru! 
a logical analysis of the properties of numbers. T 


we shall point to one more 
o way of assessing the con- 
ctural approach begins with 
here is hardly any limit one 
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can place on the number of abstract properties which might result from an 
exhaustive analysis. The analysis might cause our attention to be focused on 
properties which were irrelevant or superfluous to confirmation. What we 
need is some way of assessing the consequences of a failure in correspondence 
with respect to any given property, thereby assuring ourselves of its relevance. 
We could then establish a class of properties for which failure in corres- 
pondence had serious consequences and a class of properties for which 
failure had negligible consequences. 


To date, there is very little evidence of success in using primitive operations 
to confirm or disconfirm the designations of psychological measurement. To 
say there has been little success does not mean that there can be none, Or 
that there will be none. The possibility of breaking through this barrier in 
the future cannot be denied. In the meantime, however, psychologists who 
wish to take the structural approach to confirmation find themselves in à 
difficult and, perhaps, awkward situation. What happens if one wishes to seek 
confirmation for a given scale and cannot discover the primitive operations 
required in his case? Does he abandon his scale? Not likely, for to do so 
would mean, at this stage in the development of psychology, abandoning 
most, if not all, of psychological measurement. 


The Functional Approach 


There is a sense in which one might say the structural approach is 4 
molecular approach. That is, it requires an enumeration of individual pro- 
perties and the checking of correspondence property by property. The 
structural approach requires a logical decomposition of the general features 
of the number system into quite specific attributes. There is neither a priori 
reason nor evidence which compels us to believe that confirmation or dis- 
confirmation can only be achieved by means of a molecular approach. It may 
not be at all necessary to demonstrate point-by-point correspondence for à 
list of abstracted number Properties. It is entirely possible that an approach, 
in which attention is directed to the general attributes of sets of numbers 
à molar approach—can provide the evidence that is necessary and sufficient 
for confirmation or disconfirmation. 

If the structural approach is a mole 
is molar. Essentially, 
tion between two sets 


cular one, then the functional approach 
it requires examination of the correspondence or rela- 
of paired numbers in each one of three kinds of situa- 
tions. The three situations provide the basis for evaluating three important 
characteristics of a measurement variable: (1) its distribution, (2) its 
reproducibility, and (3) its predictiveness, 


By distribution we mean that, under some, if not all, circumstances: 
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measures of different objects or events should vary. That is to say, a given 
psychological test or scale, if it is to satisfy the distribution criterion, should, 
when it is administered to some group of subjects, produce varied scores. If 
that test or scale fails to produce variable scores, as it is applied to various 
groups of subjects, there is no evidence of confirmation with respect to 
distribution. There is an assumption here that on any single occasion people 
respond differently and that the test or scale should reflect these response 
differences. 
. À common way of exhibiting the variation in response, when it is detected, 
is to arrange the scores or measures in a frequency distribution which gives 
2 pairing of each different measure of response and its frequency of occurrence 
in the group of subjects. Thus the characteristics of the relation between 
frequency and the magnitude of some measurement variable would be taken 
account of in evaluating the measurement with respect to the first criterion— 


distribution. 
By reproducibility we mea 


Same object or event should not vary 
this second requirement calls for two sets of measures on the same variable 


and the same subjects. The two sets of measures are paired, of course, as a 
consequence of measuring each subject twice. Assuming that the distribution 
requirement is met for both sets of measures, then the reproducibility require- 
ment is met if there is a linear relation between the first set of measures and 
the second, and if the slope of the function is approximately unity. Saying 
there should be a linear relation does not imply that it should be perfect or 
that the plotted points must lie in a perfectly straight line. Some degree of 
irregularity can be tolerated. Just how this linear relation and the satis- 
factoriness of the regularity might be expressed will not be described here. 
It will be discussed in Chapter 9, which covers the descriptive uses of product 
moments. Notice that implicit in the evaluation is the ordering of subjects 


n that the results of repeated measurement of the 
much. In psychological measurement, 


Table 5.1 


TWO SETS OF MEASURES ON THE SAME SUBJECTS; 
i‘ TION OF REPRODUCIBILITY 


AN ILLUSTRA 
First Set Second Set 
1 17 18 
2 14 14 
3 25 26 
4 23 24 
5 11 12 
Subjects & 13 12 
7 24 24 
8 21 21 
9 15 16 
10 20 19 
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on each occasion and an examination of the equality or inequality of corres- 
ponding differences on the two sets of measures. If the relation is to be "o 
and the slope is to be unity, then the difference between two subjects on the 
first occasion must be equal to the difference between the same subjects on the 
second occasion. 

As an illustration of a situation in which the reproducibility requirement 
is satisfied, two sets of measures are given in Table 5.1 and the plot of the 
two sets is given in Figure 5.1. A straight line has been drawn through the 


30r 


Second Set 


FAY 
Vg 15 20 25 
First Set 


Fig. 5.1 


" FN. i d 
Plot of the two sets of measures in Table 5.1. Notice the distinct linear tren 
of the point 


s. The plot indicates that the reproducibility of the measures is quite good. 


; P " " T e 
configuration of points to show how close to linearity the relation is. Th 


position of the line was determined by inspection. Its slope is approximately 
unity. 


Table 5.2 


TWO SETS OF MEASURES ILLUSTRATING 
^ LACK OF REPRODUCIBILITY 


First Set Second Set 
1 15 19 
2 11 22 
3 13 14 
4 23 13 
Subjects : - i 
7 19 20 
8 10 17 
9 14 11 
10 


18 10 


ne Functional Appr 
] 
pproach 10 


As an ill i ituation i 

temet Mikel ne of a situation in which the reproducibility requirement 

i. Ie 5 a e sets of measures are given in Table 5.2. Their plot appears 
.2. No line has been drawn through the points because the 


25 
r 
* 
= 20 : 
s 
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~v 
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$ 
€ St . 
Pee ae 
106 15 20 25 


First Set 


Table 5.2. Ther 
that the measures l 


eis no pronounced trend in 


Fig. 5.2 
Plot of the two sets of measures in ducibil 
ack reproducil ility. 


the co; 
nfi 1 z 
guration of points. The plot indicates 


Table 5.3 
TWO VARIABLES; 


MEASURES ON 
REDICTIVE RELATION 


AN EXAMPLE OF A P 


Variables 
y r 
1 22 47 
2 16 35 
3 11 6 
4 27 38 
5 21 46 
6 30 25 
7 24 45 
8 29 30 
9 12 13 
210 18 a 
Subjects 11 26 f 
12 28 
13 13 20 
14 14 25 
15 23 46 
16 25 44 
17 20 45 
18 19 44 
ME X 
17 


20 
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irregularity and inconsistency of corresponding differences are extreme and 
there is no suggestion of a linear relation. : : 

The third requirement, in the functional approach to confirmation, is 
predictiveness. By predictiveness we mean that, given measures of the same 
objects or events on two different variables, the two sets of paired observations 
should be so related that, from a measure on one variable, one can predict 
the corresponding measure on the other variable. It is not required that the 
prediction be perfect and, consequently, one needs a standard for deciding 
when the requirement is met. The procedures commonly used for deciding 


the issue will be discussed in Chapter 9, which deals with the descriptive 
uses of product moments. 


In Table 5.3 are presented measures on twenty subjects on two ee 
variables, X and Y. The twenty pairs of measures are plotted in Figure 5.3. 


50 


40 s x 


cmm mm 
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Fig.5.3 Plot of the paired measures from Table 5.3. Notice the highly predictive relation 
between X and Y. 


Notice that the pattern of points is distinc 
diction from X to Y would be fairl 
curve which fits the configuration 


tly curved indicating that the Loe 
y accurate if one knew the equation for t 
- The pattern could, of course, take many 


+a Hy 
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Fig. 5.4 Plot of paired 


measures for var 
predictive relation between 


iable X and Y. There is no indication of 2 
the two variables, 
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A highly irregular 
d be indicative of a 
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other forms, i " 
pattern, aul pit iue that of a simple, linear trend 
failure in predicti he one displayed in Figure 5.4, Soil 
ftev: iveness for variable X with respect to variabl 

strong i een our experience that man d iain 
ther ntuitive appreciation of certai y students of psychology possess à 
quality or ine 4 L rtain scale properties which have to do with 
the relevance of Mie of intervals, but that relatively few students perceive 
sions by vague ref equa intervals is introduced into their 
employed in the references to the obvious equality of the standard units 
an uneasy refere measurement of physical length and is then dismissed with 
Probably not nee to the intervals on some psychological scale as being 
Panied by mis equal. This uncertain view of measurement is often accom- 
ie of feet about the consequences of using nonlinear transforma- 
Decause of the er such as square roots, logarithms, and reciprocals, 
inadequacy is he ect of these transformations on the scale intervals. An 
that ilis bata revealed in the student's thinking by his apparent belief 
Which was not t tion of ranks can introduce a kind of regularity into his data 

Implicit in s be found in the original measures. 

are many echo uncertain and sceptical view of psycholo. 
Investment ee which could, perhaps, be made explicit, but only with the 
Profitable, in a en deal of time and effort in the analysis. It might be un- 
a and con e end, to make such an investment in the analysis of vague 
eveloping a cerns. It could be more worthwhile to spend our time here in 
an inadequ n adequate approach to measurement than in trying to cope with 
ask about ate one. We propose that the only appropriate questions one can 
regularity ha intervals of a given scale are the following: Is there any 
s at scale bate discerned in the relation between observed differences on 
h It does e | observed differences on some other scale? Can this regularity, 
Quation ? xist, be expressed by means of a mathematical function or 


gical measurement 


y be difficult or impossible 
or irregularities) of 
other scale can be 
difficult. Furthermore, it is 
amine measures on one 
re isa familiar and 
dely applicable. 
dasa standard 
even be true 


cale mà 
e regularities ( 
Is of an 
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ever im and expressed. The procedure is not 
Scale as pmi That is to say, one cn always €X 
Standard unction of measures on another. The procedu 
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be considered a resort to a primitive type of expression of functional relations. 

There is one final point to be considered—the relation between the 
structural approach and the functional approach. As appealing as the struc- 
tural approach may be to a theoretically oriented research psychologist, he 
cannot afford to disregard the requirements of the functional approach. That 
is, even if he could achieve confirmation by demonstrating point-by-point 
correspondence, which as we have indicated is difficult, uncertain, and per- 
haps impossible, he could not take distribution, reproducibility, and pre- 
dictiveness for granted. He would surely be very much interested in studying 
these characteristics of his scale and in achieving them. On the other hand, 
the practical-minded research psychologist may turn his attention directly to 
the requirements of the functional approach, assuming that if he can meet 


those requirements, then the requirements of the structural approach are not 
binding on him. 


Summary 


The important semantical question as to the nature of confirmation is a 
difficult one. Two representative positions with respect to the question's 
answer have been described briefly and in somewhat general and elementary 
terms. It is probably apparent to the student that the present author takes à 
position supporting the functional approach. There is little doubt that discus- 
sion of these and other positions will continue for a long time before anything 


resembling complete agreement or unanimity of thought is realized by 
research psychologists. 


Devising New Scales 


Before one can seek confirmation, one must have devised a provisional 
method of measurement. There must be a task or a set of tasks. Standard 
procedures, which almost always include a set of instructions for the subject; 
the investigator, or both, must be adopted. Materials or instrumentation may 
have to be devised. Arrangements have to be made for recording observa- 
tions. Out of this integrated System of tasks, procedures, and materials will 
come a record of the subject's behavior to which numbers either already are 
or soon will be assigned. The quantified record represents a provisional set 
of measures, whose quality is unknown. An attempt at confirmation of the 
designations is usually the next step. But we have discussed confirmation; let 
as turn our attention now to the origins of the scale. 


From what source do the ideas, plans, and specifications for the provisional 
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system come? It will be worthwhile to intensify our awareness and sharpen 
our perception of the source. Perhaps we should say * sources," for there are 
at least three principal ones. First, the psychologist imitates the work of 
others and often the imitation is built upon other imitations and is far 
removed in time from the work of the originator. Second, he acts upon 
* hunches and intuitions, which arise not only out of his professional and 
research experience but also out of his everyday experience of living. Third, 
he develops theories and from a particular theory he may deduce hypotheses 
about the quantification of behavior, which lend themselves to translation 
into provisional methods of measurement. Whatever the source—imitation, 
intuition, or theory—he obtains from it no more than a provisional scale. 
The attempt at confirmation must follow. 

Most of the scales employed currently in psychological research are 
basically imitations of other scales. As we shall see later, it is appropriate 
that this should be so. Although the multitude of scales now being employed 
could be classified according to a variety of schemes, for purposes of our 
brief discussion, which is intended to provide the student with an overview 
of current practices, a simple classification scheme will be employed. The 
classification scheme involves five categories, each category being identified 
by a manifest and salient feature of the system which results in the assigning 


of numbers. 

The first category contains physical measures of response. The distance an 
athlete can put the shot, the height he can jump, his time in running à ecce 
of a mile, and many other assignments of nce e — 5 
as provisional measures of response The length of a aie s pm ira 3 
a child, the time which elapses between the PT iy of water drunk by 
Pinon and the response of the een a Seri diac, situation, 
à dog, the pressure a subject exerts on à peris in the palm of a subject's 
tang es tage d conductivity of the as of behavior. Many scite 

€ listed in vri ian aa of them, of course: are of less curren 


to H 
pe etnlogists than are others. 3 measurement which require the 
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inventory, the number of parts assembled by a factory worker a a E e 
number of accidents in which a truck driver is involved, the number hrs 
a housewife expresses a preference for one brand of soap in v ied. E 
with nine other brands, the number of activities a hrigheschoal vers " : 
as his interests, the number of self-references in a client's protocol gm : 
therapy session, the number of occasions on which a student e i 
particular reference book in the library, and the number of nonsense sy x 
recalled from a list which had been learned previously to some a e- 
these and countless others make up the class of psychological measures whic 
require that responses be identified and counted. . 

Tn the third tion are the measurements of classical psychophysics. bes 
psychologist records a physical measure of a stimulus under standard T 
ditions for counting the responses of a subject who must respond to r 
stimulus with a discrimination of some sort.? The discriminatory response 
the subject may be detecting the presence of a stimulus or detecting fi 
difference between two stimuli. It has been known for a long time tha 
a stimulus or a difference between stimuli of any given magnitude within i 
broad range of values on a physical dimension, will sometimes be ee 
and sometimes not. Thus in a large number of judgments of a given stimu a 
magnitude or a given difference, the proportion of correct judgments can e 
determined. Since any stimulus within a considerable range will sometimes 
be detected and sometimes not, there is no value which obviously qualifies 
as the threshold, that is, as the lowest detectable value or the smallest xen 
able difference. The general practice is to define a threshold as a physica 
measure of the stimulus which is correctly detected in some specified propor- 
tion of trials. The procedure in determining a threshold value is to specify 
the proportion of correct judgments which will be taken as defining the 
threshold. The Psychologist must then determine by systematic testing pro- 
cedures, of which there are more than can be described adequately here, the 
value of the stimulus associated with the Specified proportion. If the task is 
to detect the presence of a stimulus, the value so determined is called an 
absolute threshold. If the task is to detect a difference between stimuli, the 
value is called a differential threshold. 

The intensity of a tone of specified frequency, the difference in frequency 
of two tones of specified intensity, the separation in millimeters between TWO 
black lines in a test of visual acuity, the voltage employed in the projection 
of taboo words on a Screen, the concentration of sugar in a solution which is 
tasted repeatedly by a subject, the concentration of monosodium glutamate 
in a solution for taste testing, the ratio of the major axis of an ellipse to the 
minor axis in a visual test, the difference in temperature between two wate! 


baths in which the subject immerses his hands, and the angle of tilt of a chair 


"The specific psychophysical methods implied by this Statement are the “ Constans 
methods." The method of adjustment and the method of limits require physical measur 
ut the data are handled differently. 
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In which the subject i 

ofa subject is seated—these i "e n 
urth c. t 
more persons iue eo = measures derived from the judgments of one or 
or events judged. I ased on physically measured attributes of the objects 
9r event is eb . In the method of absolute judgment, the stimulus object 
The rating ma m to the judge who responds by giving à numerical rating. 
device or it "in D by the judge with the help of some simple graphic 
or more dois] e recorded directly as a number. The ratings given by two 
method of com Wt Tesponse to the same stimulus are often averaged. In the 
to a judge who doi judgment, two or more stimulus objects are presented 
On some assum "pee a preference for one of the objects, orders the objects 
The responses i dimension, or judges the relation(s) between the objects. 
as placing the Of S BTONP of judges are then tabulated and given expression 
Judging opera objects in positions on one or more quantitative scales. The 
given judgin tions vary from one research problem to another, and, for any 
the obterved Í operation, there may be different procedures for converting 
tively ich a ee into scale values. Some conversion systems are rela- 
Proportion anes is the case when the objects are ordered according to the 
Somewhat co preferences expressed for each. Other conversion systems are 
Comparative pee An example of a complex system is Thurstone’s law of 
Listed ment. 

events: Pres are examples of ratings which 
research o efficiency, ability to do independent study, 
Ssignment, other creative acivity, social acceptance, 
Ment, agil resourcefulness, emotional control, speed and a 
Organizinn, ity, i daring, stamina, initiative, | awareness, 
£ ability, conformity, reasonableness, leadership, integ"! 
friendliness, courage, effort, 
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potential capacity for 
interest in military 
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eee emotional stability, and dominance. M l 
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Prefere rent pairings possible for four objects, h subject indicates SIX 
Nees. The preferences O jects are used to derive 
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the four scents on a quan 


views each one 


e scale. 
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2. In a study of verbal associations, a psychologist develops a list of ten 
words and forms all possible groups of three, numbering 120 in all. His 
subjects are instructed to examine each triad and to choose the two words 
closest together in meaning. Data for 100 subjects are combined, the relative 
frequency with which each word is associated with every other word is com- 
puted, and the pattern of relations among the words is studied. The result 


is that the ten words are assigned positions on each one of s 


everal hypothetical 
Scales. 


3. In developing a criterion of performance for enlisted men, a personnel 
psychologist had supervising officers rank their men on technical competence. 
Each officer was presented with a list of twenty enlisted men whom he had 
supervised and was instructed to order them in terms of competence, using 
for ranks the first twenty integers and giving the most competent man a rank 
of one and the least competent man a rank of twenty. 


In the attempts to quantify absolute and comparative judgments, where no 


relevant physical measure of the stimulus is available, a great deal of research 


The fifth and last categor 
psychological measures is not 


points of the plot will lie in a st 


; i raight line. When 
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a student’s description of himself with respect to social competence and his 
description of the person he would, ideally, like to be. The psychologist 
collects an appropriate set of thirty-two statements, each of which can be 
employed for self description in regard to social competence. The statements 
are typed separately on thirty-two cards. In an interview with each of his 


student clients, he instructs the client to sort the thirty-two cards into six 


categories on a scale running from most descriptive of himself to least 


descriptive. The first and sixth categories are to contain one statement in each; 
the second and fifth categories are to contain five statements; the third and 
fourth categories are to contain ten statements. After the sorting is completed, 
the psychologist records the placement of each statement and then has the 
client repeat the task, sorting on this second occasion into six categories 
running from most descriptive to least descriptive of the person he would, 
ideally, like to be. Again the thirty-two placements are recorded. Each state- 
ment now has two placements. If every statement is given the same placement 
on both occasions, there is no discrepancy between the subject’s two descrip- 
tions and congruence is maximal. On the other hand, much inconsistency in 
the placements indicates that the two descriptions are quite discrepant and 
that congruence is low. Again, as in the two previous examples, the place- 
ments from the first sorting can be plotted against the placements from the 
second. If the placements in the two sortings are identical, then all thirty-two 
points will be located on a straight line, the discrepancy will be zero and the 
congruence will be perfect. A considerable degree of scatter in the points 
would indicate a considerable degree of discrepancy in the two descriptions 
and very low congruence. Congruence, Or the lack of discrepancy, can = 
measured by computing à coefficient of correlation for the two sets o 


placements. 
t of psychological measurement. The 


We have now completed our accoun : reri oasi 
purpose of this account was to provide the student with a brief review of pas 
and present measurement practices. It is hoped that the a will m p 

" kground from which the 
a gen e nature and scope of the bac : u 
miele h he can employ in his 


i in i hic 
Psychologist can obtain ideas for measures w 
research. Confronted with the need for a scale, he can, and perhaps should, 


scan the whole field of psychological measurement. Is oe a possibility that 
some physical measure of response would be appropriate ? Ed "rsen 
the task of the subject so that a threshold measure can be obtaine des 
one specify a class of responses which can be identified and are ? n 
judgments be employed to any advantage? Do sorting tasks offer any goo 
Possibilities ? 

Suggesting that a psychol 
measurement scene when he is un 


merely to emphasize that he shou 
that he should take advantage of what has a 


ychologist should deliberately survey the entire 
dertaking research on a given problem is 
ld consider the full range of possibilities, 
lready been done, and that he 
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should try to relate his work to the work of others. If he does so, he ie 
sometimes simply adopt the specific measuring devices which someone else 
has employed. Frequently he will find it necessary to modify or adapt tet 
techniques or instruments of others and may, in this way, make an original 
contribution to the field of measurement. Occasionally his survey may yield 
him nothing suitable for his purposes and he may have to engage in extensive 
research on the measurement problem itself. 

What we have called imitation is the largest source of ideas for measurement 
in research, but it is not the only one. There is always the possibility that 
some essentially new way of measuring behavior will be invented or dis- 
covered, will be investigated and confirmed, and will then take its place in 
the body of available measurement practices. We have already mentioned the 
two sources of new ideas. They are intuition and theory. 

Naming intuition as a source of ideas for new 
nizing the fact that the source of ori 
can seldom, if ever, 
of certainty. A man 
learned colleagues, 
suddenly begins to 
entirely original? 


measures is simply recog- 
ginal ideas, whatever they may concern, 
be determined in an objective fashion or with any degree 
reads widely on some topic or problem, discusses it with 
thinks about it intensely over a long period of time, and 
talk about it as no one else has. Is his idea really new and 
That is doubtful, but it may be so regarded. Thus it is that 
the invention of calculus is credited to Newton—and to Leibniz. In psycho- 
logy, it may very well be that everyday experience, as well as study and 
Tesearch, plays a significant part in the conception of new ideas about 
measurement. 
Finally, 


l the important place of theory in psychological measurement must 
be given proper recognition and em 


been formulated in at least fou 


of particular theories in these or other areas 
are not germane to the purpos 
that purpose is a consideration, in general terms, of the nature of theory and 
its relation to confirmation, 
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In its lowest form, it might be no more than a loose sequence of plausible 
statements, in which case it is lacking in rigor and is usually the cause of 
disagreement, argument, and controversy. Translating the terms of hypotheses 
and theory into the concrete operations of measurement is always accom- 
panied by some degree of awkwardness and uncertainty. It is seldom, if ever, 
obvious as to just how this translation can be properly effected. The gap 
between the terms of the theory and the corresponding operations of 
measurement is disturbingly large. 

Theory is, of course, very important as a vehicle for original thinking about 
measurement and as a source of inventions, discoveries, new approaches, 
modifications, and elaborations, but one point, concerning what theory 
cannot in itself accomplish, should be clear: Theorizing does not itself consti- 
tute confirmation. Theorizing before or during the process of scale construc- 
tion may make confirmation more likely when confirmation is attempted, but 
there is no law that says it will do so in every endeavor. Everything depends 
on the quality of the theorizing and, ultimately, its relevance to the behavior 
in question. There are no guarantees of either quality or relevance. 


Summary 


Two approaches to confirmation were described. The structural approach 
requires the demonstration of correspondence, point by point, between pro- 
perties of numbers and properties of behavior. The functional approach 
requires the demonstration of distribution, reproducibility, and predictive- 
ness. Measurement theory has its place in the devising of new pare e 
theorizing does not itself constitute confirmation. Confirmation is empirical, 


it requires demonstration and observation; it cannot be deduced. 
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BY MEANS OF MOMENTS 
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Psychological data may, on occasion, take the simple form of a single set 
of measures which exhibit variability. The set may consist of repeated 
observations of the behavior of one subject, or it may consist of measures on 
a group of subjects, each of whom has been measured on one occasion. 

The psychologist may wish to describe the set of observations; that is, he 
may wish to communicate to his colleagues, his employer, the public, or his 
students, information concerning the nature of his observations. He can, of 
course, provide a very complete and exact description by simply listing all of 
the measures. While this kind of description is complete and, in a sense, exact, 
it suffers from certain definite limitations. Although it might be quite satis- 
factory to list the measures if there were only a few, it would not be at all 
satisfactory to do so if there were many. If the number of observations was 
large, it would be difficult or impossible for the investigator’s audience OF 
reader to form an adequate impression concerning any of their character- 
istics and the salient features of the set of observations might not be apparent 
at all. Regularities in the data might be completely obscured by the mass of 
data and their irregularities. 

To provide the student with some experience of the difficulties inherent in 
trying to grasp the main features of a fairly large number of observations, 
we have listed in Table 6.1 scores on a test. How many scores are there? 
What are the highest and lowest values? Can you determine from an inspec- 
tion any other characteristics of the data? 

Instead of listing all of the Observations, the psychologist might classify 
them in a frequency distribution and represent the distribution graphically 
by means of a histogram. The frequency distribution and the histogram 
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together provide an exact representation of the data in a form, some features 
of which are quickly and easily comprehended by the viewer. The person 


Table 6.1 
A LISTING OF TEST SCORES 
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assificatory significance in the table; they represent 


Note: Columns and rows have no cl 
data. 


simply an arbitrary arrangement of the 
its histogram can readily determine the 
points of heavy and light concentrations of 
on of the shape of the distribution: its 


Symmetry or lack of symmetry, and its flatness or peakedness. Table 6.2 
Contains a frequency distribution for the test scores in Table 6.1. Figure 6.1 
is a histogram for that frequency distribution. 

In theory, there is another way of providing à complete and exact 
description of a set of measures, but, in practice, it is almost always either 
very difficult or impossible. A complete description of the data might be 
given in terms of moments. The difficulty is that a large number of moments 
might be required. When the set of measures covers, in its range, r distinct 
Score intervals, then r moments are required for a complete description. The 
r moments may include the zeroth and all others up to the (r — 1)!^ moment. 


examining the distribution and 
highest and lowest values and the 
values; he can also form an impress! 
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Table 6.2 


A FREQUENCY DISTRIBUTION OF 
TEST SCORES FROM TABLE 6.1 


Xx f 
y 1 
8 10 
9 42 
10 90 
11 108 
12 78 
13 38 
14 14 
15 3 


N = 384 = total number of scores. 


120 
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: . ` 6.2. 
Fig. 6.1 A histogram representing the frequency distribution of test scores in Table 
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moments are specified can be proved, but the proof Pos 
If the distribution has six score intervals, these same fi t 
pletely describe it. That is to say, there are many dieron 
utions having the same values for the four descriptiv" 
moments and N, The example of skewness when S — 0, which was given ! 
Figure 4.5, is explained by this fact. " Gil 
The frequency distribution of Table 6.2 and the histogram of Figure be 
have nine score-intervals. The value of N and eight moments would m 
required for complete description of the test scores on which the Bis 
and the frequency distribution were based. It would be quite impractical 
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compute the highe 
and ei gher moments, which i i 
| saath puse ar the variable. would involve the fifth, sixth, seventh, 
is reasonable Es Pose Meg sn ee not desirable or not practical, it 
marizatio possibilities of i ipti i 
an an summarizing of data can be bs messis 
Gallestion of cag and directs attention to, the salient one 
nomical. It may m Summarizing is computationally feasible and ae 
to provide a limit " y the requirements and the obligations of the investigator 
may be no more 2 summary, in which case the information communicated 
fained in the dat bin a crude approximation to the total information con- 
communicated nm or an extensive summary, in which case the information 
Partial descri ay be, for all practical purposes, à Very good approximation 
Take a choice re or summarizing requires that the interpreter of dta 
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ended by ees properties of a particular moment and the interpretation 
Particular s investigator. Developing an understanding of the relevance 
oments to particular practical questions or notions is an 


extre 

mely di 

s nely di ea F h 
tanding fficult task for the student. It is important to cultivate this under- 
even the imitation of recognized 


authorities r. PO. to rely on imitation. 

Often lead itation without understanding is often based on superficialities 
tis ouro 3 ta incorrect use of statistics. - 

a Paiticnitas ro that an adequate working conceptualizatio^ with respect 

esis of "i momen requires à thorough familiarity with, and a kind of 

; the logical properties of that moment, in ready association with 

hich vary but have certain 


Clas 
S of 
e racti Pam 
atures in EE situations and problems, W 
mmon. 
general terms, the various kinds 
ist and 


"© pro : 
Pose discussing below, in somewhat 
ered by the psycholog 


Situatio 
appropri and problems commonly encount 
fore | riate choices of one or more moments for purposes of description. 
we must emphasize one point: At 
dom variation as they 


bepinni 
Present iu e the discussion, however, 
ill not give consideration to error or ran 


118 $ Describing Data by Means of Moments 


are defined statistically. We will be discussing values which are propery 
classified as d-statistics and not i-statistics. In Chapter 12, oom 
be given to i-statistics and logical properties having to do with error will be 
the focus of attention. . . 

If a very limited description of a set of measures is to be given in the pe 
of a single d-statistic, the choice will probably, but not necessarily, be the 
mean. The mean conveys information about the magnitude of — 
but, of course, only in a general sense. The recipient of the information shoul 
know that the measures vary around the mean and that they lie in a region 
of the scale roughly indicated or approximated by the value of the mean. He 
should also realize that knowing only the mean leaves him without any other 
information. An assumption regarding other characteristics of the data would 
be risky. For example, he knows nothing of the magnitude of differences 
among the measures. 

The mean of the test scores in Table 6.1 and the distribution in Table 6.2 
is 11, Reflect, for a moment, on the fact that the mean is 11, and consider 
how much, or how little, one knows about the scores. Examine again the 
distribution in Table 6.2 and try to specify some of the properties of the 
distribution not revealed by the value of the mean. 

Somewhat more completeness of description is achieved when two 
moments, the mean and the variance (or the standard deviation) are com- 
puted. The mean gives the general level of performance; the variance indicates 
the general magnitude of differences among the measures. It is a common- 
place now to present the mean and variance of a distribution of observations 
and, in the use of moments, this practice occurs with far greater frequency 
than any other. 

The variance of the distribution of scores in Table 6.2 is 2. Then S, = 42. 
We have already indicated that the mean is 11. Recall that the variance is 
the average of the squares of deviations from the mean. It follows that, for 
the squared deviations, the variance is an interjacent value. That is, there are 
Some squared deviations greater than 2 and some less than 2, if they vary at 
all. We also know that some of the original deviations were positive and 
some, negative. Knowing the mean and the variance gives one some crude 
notion of the general magnitude and the dispersion of the scores, but 
certainly does not give a clear or exact picture. 

Computation of the four descriptive moments is probably the most 
complete description which may, at the present time, be said to be practical. 
It should be understood, however, that the distribution is not fixed in all 
respects by determination of the four values: M,, Va, S, and K. Although 
a number of moments equal to the number of score intervals would be 
required for a complete description, it is our opinion that distributions of 
psychological measures are quite adequately described by the first four 
moments and that variation in features not fixed by these values can, in 


119 


Pragmatics 


practice, be disregarded. In other words, we are saying that the first four 
moments will take account of the important regularities or salient features 
in psychological data and that the irregularities or details not taken account 
of can be ignored. This opinion should not be mistaken for a statement of fact. 
In the future, experience may indicate the desirability and even the necessity 
for going on to additional, higher moments. 

The index of skewness for the distribution in Table 6.2 is +0.177. The 
index of kurtosis is 2.875. Thus the first four descriptive moments can be 


summarized as follows: 


M, = 11, 
Vy = 2, 

S = 0177, 
K = 2875. 


The distribution is positively skew, but only moderately so. In the range of 
values for K commonly encountered in practice, the obtained value is not 
extreme; it lies well within the range of moderate values of K. 

Description of the distribution in Table 6.2 in terms of four moments is 


x 


distribution completely described by the four 


Fig. 6.2 A five-interval histogram for a 
descriptive moments: My = 11, Vx = 2, S = 0.177, and K = 2.875. N = 384. 
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original distribution inexactly, but they provide an exact description of the 
distribution represented by the histogram in Figure 6.2. One might, therefore, 
conceptualize this description of Figure 6.1 in terms of Figure 6.2. 


Measures on a Single Subject 


In the preceding section, a distributi 
described through the use of mo 
involving the description of 48 m 

A psychologist measured the r 


on of test scores for 384 subjects was 
ments. We shall consider next an example 
easures on a single subject. 


eaction time of a single subject on presenta- 
48 trials. Measures were recorded in twenty- 
sures are listed in Table 6.3. The frequency 
4. The histogram is shown in Figure 6.3. 


Table 6.3 
MEASURES OF REACTION TIME FOR A SINGLE SUBJECT 


an 


Un Ca RR CA M Cn 4 CA 
UNDNADDUA 
PBRUDUNURUD 
PRIUUARUD 


Each observ; 


\ ation is expressed 
classificatory s 


atio in twenty-fifths of a second. Columns and rows have no 
ignificance; the ar 


rangement is arbitrary, 
Table 6.4 


A FREQUENCY DISTRIBUTION OF MEASURES 
OF REACTION TIME 


X 
3 i 
4 16 
5 18 
6 8 
7 5 
Original data Biven in Table 63. y 
second. 


7 Measure of reaction time in twenty-fifths of a 
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20 


, Fig. 6.3. A histogram representing the frequency distribution of reaction time measures 
in Table 6.4. M, = 5, V, 1, S = 0.50, and K 2.5. X — reaction time measured in 


twenty-fifths of a second. 


Mz = 5, 
Vz = 1, 
S =0.50, 
K =2.50. 


on is positively skew, since one would 


It is not surprising that the distributi 
reaction time, a limit imposed possibly 


expect the existence of a lower limit on 
by physiological and neurological factors. 


Some Issues in Description 


In characterizing a single subject or a group of subjects, by presenting the 
mean, or in comparing one subject with another or one group with another 
by presenting the difference between two means, interest is often directed to 
one end of the scale or the other. The investigator may wish to point out that 
the individual or group is high or low on the scale, as the case may be. When 
he is making a comparison, he may wish to point out that one individual or 
one group is higher or lower than another individual or group. There are 
Occasions, however, when interest is directed to the relation between the 
mean and some important reference point. For example, the measures might 
be positive and negative in value, representing deviations above and below 
some true value or known standard. 

The choice of the mean as the only descriptive value to be presented is 
Sometimes justified; sometimes, it is not. When the variation around the 
mean is judged to be the result of measurement errors or momentary and 
transitory fluctuations in response, as might well be the case in certain kinds 
of repeated measurements on a single subject, then computing a mean may 
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be looked upon as eliminating error, that is, as removing variability which 
only obscures the true nature of the performance and which properly should 
be removed. An example would be establishing a threshold for visual acuity 
by repeated determinations under the same conditions. Variation in the values 
obtained might be considered error and might, therefore, be eliminated by 
computing a mean. 
When individuals or Broups are known to have approximately equal 
variances and higher moments, the presentation of means may be all that is 
required for a comparison. Since, under these conditions, the individuals or 


ike in all respects save the one on which 


l. Tw 
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of observations on which they are based varies. Sums can only be compared 
when the numbers of observations are equal, since a sum is directly a function 
of the number of values as well as their magnitude. 

When variances may be unequal but means and the higher moments are 
equal, it is defensible to limit the comparison of individuals or groups to a 
comparison of variances. A larger variance indicates greater differences 
among the responses of an individual or the responses of a group than the 
differences for the other individual or group. It is interesting to note that, 
instead of variances, sums of squares of deviations might be used for com- 
Parison when N’s are equal. As was the case with means in relation to sums, 
variances can be compared when N’s are not equal whereas sums of squares 
of deviations cannot be compared. 

In comparing individuals or groups with respect to both means and 
variances, a variety of interpretations is possible depending on the particular 
combination of values and the practical situation. When means are equal and 
variances are equal, or judged to be approximately so, the comparison offers 
No difficulties of interpretation. The conclusion that the individuals or the 
groups are alike is defensible unless there is some reason to believe that they 
differ with respect to higher moments. As has already been indicated, when 
means are unequal and variances as well as the higher moments are equal, 
it is reasonable to base one’s interpretation on a comparison of means; when 
variances are unequal but means as well as the higher moments are equal, 
variances may be compared and interpreted. Values far means and variances 
may, on occasion, suggest that there is little or no overlap in the two sets of 
measures and that there is, consequently, a fairly substantial basis for the 
interpretation that one set of performances is superior to the other. The 
question of overlap can, of course, be answered much more conclusively 
by examination of the two frequency distributions or their histograms, and 
their overlap. 

With certain combinations of unequal means and unequal variances, an 
interpretation which goes beyond the statement of the several values may be 
unwarranted, Students should be aware that only those interpretations whose 
appropriateness is obvious, or very nearly so, or which can be reduced to 
terms in which they become obvious and command assent, are worthwhile. 
When an interpretation cannot be stated so that its correctness is obvious, 
persisting in that interpretation only provokes unfruitful debate and 


argument. 


Test Construction 


ments are employed extensively in the construction and 


The descriptive mo: 
t scores can be given a 


revision of psychological tests. A distribution of tes 
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partial description by computation of its mean, variance, index of skewness, 
and index of kurtosis. It is possible to express each of these moments in terms 
of certain characteristics of the items in the test so that it becomes theoretically 
possible to select items to produce a test whose score distribution will have the 
desired moments. What we are suggesting is that the distribution of test 
scores can be modified and determined through the selection of items for the 


test. On occasion we may wish to produce any one of a large number of 
distributions of varied form. 


We may wish to produce a binomial distribution for which the number of 
score intervals is (n + 1) and the frequencies are given by the successive terms 
in the expansion of (1 + 1)", where 2" = N. A distribution of this type is 
symmetrical and unimodal, with smaller frequencies in the tails than in the 
center. As n becomes larger, the value of the index of kurtosis for the binomial 
distribution approaches three and the function approaches the normal curve, 
an Important continuous function which will be discussed in some detail, 
later. Three examples of binomial distributions are given in Table 6.5. 


Table 6.5 
EXAMPLES OF BINOMIAL DISTRIBUTIONS 
X f x y 
n=2; n= 10; 

^ = "s iie = EA 
0 1 0 1 
1 2 1 10 
2 1 2 45 
( n «y 3 120 
= 4; 4 210 
a = 16 5 252 
1 6 210 
H 4 7 120 
7 : 8 45 
d 9 10 
10 1 


à m en sometime wish to produce a test which 
normly, or approximately so, over the score 


to the failing of a single test į Practice to assign a score of zero 


item. Consequently, Scores of a group of 
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Table 6.6 


DISTRIBUTIONS PROVIDING TWO 
KINDS OF DIFFERENTIATION 


X f x 

(N — 50) (N — 48) 
0 10 0 3 
1 10 1 8 
2 10 2 30 
3 10 3 0 
4 10 4 7 


vary, taking on values of zero and one. Two moments only, the zeroth and 
a distribution of item scores. 


the first, are required to describe completely 
An example of a distribution of item scores is given below. 


I f 
0 60 
1 40 


In the example, N = 100 and the mean, M; = XfIJN = (60 x 0 + 40 
x 1)/100 = 0.40. Notice that M; turns out to be p, the proportion of subjects 
passing the item. V; the item variance, can be computed from the mean by 
the formula: 
= Mil — M3). 
Of course, S; = JV; 
proportion of subjects passing the item, and 


If we let M; = p = the 4 
(1 — Mj) = q = the proportion failing the item, then 


V; = pd 
and 
S, = \P9- 


Describing Populations 


in the discussion of statistical inference and 
moments are also employed in 
stributions. Population distribu- 
h random samples are obtained. 


As we shall see later on, 
sampling distributions, the four descriptive 
characterizing what are called population di 
tions are the parent distributions from whic! 
We shall defer further comment on this topic. 


The Limitations of Description 


Even when a complete description of the data at hand is possible, research 
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psychologists are seldom satisfied with this kind of analysis and evaluation. 
In describing data, one ignores error but must admit that the original 


measures and any values computed from them were subject to uncontrolled 
influences or biases, which one would like to take account of, in a more 


rigorous fashion. If there was any selection of subjects, if the occasions of 
measurement were chosen in some arbitrary manner, if there were any 
environmental influences not intended by the investigator and not strictly 
controlled by him—and these conditions are almost inevitable in any psycho- 


logical study—then the presence of error and the possibility of bias in the 
data can be taken for granted. 


Because of the presence of biasing errors in his research, a psychologist 
prefers to analyze his data by methods which make it possible to estimate the 
magnitude of errors. It is conceivable that he could, on the basis of his own 
prior experience and the reported experience of others, make a judgment as 
to the size of errors and the possible size of the bias, and could reach con- 
clusions which, even though the steps in the judging process cannot be made 
explicit, and even though the process is private and lacking in standardization, 
would be correct and could be defended in the scientific community. Never- 
theless, there are far better methods and techniques available for estimating 
error and bias, and for making decisions about research data. We are referring 
to the procedures of statistical inference. 

In saying that statistical inference provides the best methods for taking 
account of error and bias, we must add a crucial qualification. Statistical 
inference, wh propriately applied, is superior to 
tion, but when it is incorrectly and 
escriptive evaluation which has been 


Teport may be misled as tot 
report by the presence of thes 
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that certain kinds of research data which are available to psychologists and 
educators are important and should be reported in their journals, but are 
suitable only for description plus an interpretation based on — 
judgment. To sum up, humble data deserve an unpretentious analysis and a 


simple, honest report. 5 
Statistical inference will be discussed at length in Chapters 11, 12, 13, 


and 14. 


PRODUCT MOMENTS 


Syntactics | 7 


are those computed fro 


We let X; be the ith number in one set and Y; be the it” number in a second 
Set, Where i = 1,2,3,. ; N. The pairing of two values, one from each 
Set, is indicated by the common subscript, ji, The (2N) numbers can be 
Classified in the rectangular arrangement shown in Table 7.1. Each column 
in the Table Tepresents a set of numbers; each row is a pair of values. 


Table 7.1 
PAIRED VALUES OF X AND Y 


Columns 

X Y 
1 X, Y, 
2 X, Ys 
3 X; Y; 

Rows è 

t X; Y; 
N XN Yn 


The Product Moment of Z, and Zy EX 
me exponents employed, that is, by the power of X and the power of Y. The 
p peal formula for a product moment of X; and Y,—let us call it the p^ 
nd ath product moment—can be written as 
Zxpy?. 
N 
It will be convenient to dispense with the subscripts and write simply 
Ax" y” 
N 


The Product Moment of First Powers of X and Y 


case of product moments in 


We shall be concerned here with the special 
* one,” the general formula 


Which m = n — 1. When both exponents are ` 
becomes 
ZXY, 
N 
Computation of the product moment of first powers of X and Y involves 
Summing the products of X and Y. and dividing the resulting sum by N. It 
Y. be helpful to refer these operations to the paired values of X and 
in Table 7.1. 

: "i a matter of convenience, we shall 
fe er to the special case involving first powers of two var 

© understood that the term has a more general definition. 


use the term “ product moment "to 
o variables, but it should 


The Product Moment of x and y 


diaga- M,, and Y to y by the 


Let us 
transform X to x by the form A 
mure nt of x and y Is called the 


uct mome 


formula 
y = Y —M,, The prod s 
Covariance., It will be represented by the symbol, Vow THAR: 
Zy 71 
Vey = N ` G-1) 
f Zz and Zy 


The Product Moment 0 


Consider x transformed to Ze by the formula, Zz = x/Sz and y trans- 
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formed to Z, by the formula, Z, = y/S,. The product moment of Z, and Z, 


is an important value, the product-moment coefficient of correlation or Fey- 
The formula is 


_ 22,2, (7.2) 
© N 


Voy 


The product moment of first powers of X and Y appears in certain com 
puting formulas, but has no Particularly important properties. The logical 
properties of the covariance, V... and the coefficient of correlation, r,,, will 
be discussed next. In this initial discussion, no attention will be given to 
Properties having to do with error or random variation. 


Logical Properties of the Covariance 


By definition, 


"np 

N 
The numerator can be written to make explicit the contributions of the four 
possible combinations of positi 


sitive and negative deviations of x and y. If we let 
X. = positive X deviation, 
X- — negative Y deviation, 
Y+ = positive Y deviation, 
Y- = negative Y deviation, 


Fs = frequency of Pairings of positive deviations of X and positive 
deviations of Y 

f- = frequency of pairings of positive x deviations and negative Y 
deviations, 

J= = frequency of pairings of negative y deviations and positive Y 
deviations, and 


I| > (x,y 4. Zx-y4)|. 
2xy and Vey will be negative when 
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(Sx. + Ey] < Kx + Ex» 
^ is, when the absolute value of the sum of positive products is less than 
e absolute value of the sum of negative products. ZXxy and Vy, will be 
zero when the absolute values of the two sums are equal, that is, when 
Ix ys + 2x-y-)| = (Zx+y- + Zx-y+) l- 


á Notice that the absolute values depend not only on the magnitudes of 
eviations, but on the frequency with which the pairings occur as well. 


Bivariate Frequency Distributions 


Given two sets of paired numbers, X; and Y; the pairs can be doubly 
classified according to their values as shown in Table 7.2. The rectangular 
classification is called a bivariate frequency distribution. 


Table 7.2 
A BIVARIATE FREQUENCY DISTRIBUTION 


of a bivariate frequency distribution might be 
achieved by means of a three-dimensional figure having two horizontal axes, 
one for Y and one for Y, and a single vertical axis for frequency. Instead of 
the two-dimensional rectangles employed in a histogram, we would have 
three-dimensional rectangular solids whose height would represent frequency. 

Although three dimensions are implied by bivariate frequency distributions, 
it is usually more convenient to confine the figure to two dimensions. To do 
so, one takes the pair of X and Y values as the coordinates of a point in the 
two-dimensional space and plots all N points. The points may form any one 
of a great variety of configurations, but are often scattered throughout the 


A graphical representation 
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space, a circumstance which accounts for the designation of the figure as a 
scatter plot. . u 

Original values of X and Y can be used for plotting purposes, but it is 
instructive to transform each variable to deviations and to plot pairs of 
deviations, x and y. In Table 7.3 are 10 pairs of values of x and y. These 


deviations have been used as coordinates for the plotting of the 10 points 
in Figure 7.1. 


Table 7.3 
VALUES OF X AND Y 
TRANSFORMED TO DEVIATIONS 


(First Example) 


x Pd 

1 3 4 

2 1 0 

3 3 3 

4 2 1 

Pairs FS E E 
7 —4 —3 

8 —3 —2 

9 =I - 

10 2 0 


The covariance, y. " 


J i m is computed from the deviations in Table 7.3 by 
summing their products and dividing by N = 10. Thus 
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in that quadrant. In the preceding section on logical properties of the co- 
variance, f} f+- f+, and f- were the frequencies for the four possible sign 
combinations arising from the pairing of x and y deviations. These four 
frequencies can be referred to Figure 7.1 and interpreted as the number of 
points plotted in each quadrant. In this particular example, there are no 
combinations of positive x's and negative y's, and no combinations of nega- 
tive x's and positive )’s. Consequently, although the points scatter somewhat, 
all of the points lie within or on the bounds of the first and third quadrants, 
and the trend is unmistakable. 

Another bivariate distribution is given in Table 7.4 and the points for the 
paired deviations are plotted in Figure 7.2. Examination of the distribution 
will reveal that the x and y deviations of Table 7.4 are the same as those of 
Table 7.3. Although the deviations are the same, the pairing is different. 
Consequently, the covariance is different and the plot of Figure 7.2 is different 


from that of Figure 7.1. V. for Table 7.4 is zero. 


Table 7.4 


VALUES OF X AND Y 
TRANSFORMED TO DEVIATIONS 


(Second Example) 


x Jy 
1 3 0 
à 1 3 
3 3 -2 
4 2 4 
5 —3 2 
Pairs 6 0 -—5 
i —4 — 
8 —3 0 
9 =l | 
10 2 =g 
Y 
+5 
—e- +5 x 
-5 | 


hose coordinates are the paired deviations in Table 7.4. 


Fig. 7.2 Plot of points wi 
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A Computing Formula.for the Covariance 


It is, of course, possible to compute Vzy by transforming all the original 
X and Y values to deviations from the mean, obtaining the sum of the pro- 
ducts, and dividing by N. It is usually more convenient, however, to employ 


a formula expressing V, in terms of original values, X and Y. The derivation 
of such a formula is given below. 


By definition, 


Consider the numerator, Xxy, By definition, 
Ixy = X(x — M.Y — M,). 
Multiplication of the two binomials yields 
2xy = XY = M,Y — M,X + MM). 
Simplification produces 
Zxy = ZXY — M,2Y — 


M,ZX + NM,M,. 
Substitution of 2X/N for M, and ZY/N 


for M, gives 
ZXEY  Zxxy ZXY 
oxy = gam NN i C eui 
Xy ZXY N N + N 


Combining terms on the right yields the final computing formula, 


Ixy = Xxy — SENT, (7.3) 
Formula (7.3) is c 


onvenient for co 
deviations, from w 


mputing Yxy, the sum of products of 
hich V, the cova 


riance, can be obtained by dividing by N. 


Prediction Coefficients 


s Üyz 


by. = Vey (7.4) 
d is 
an 
Fui 
Dey = Y (7.5) 


These values are also called regression coefficients, 
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Notice that each prediction coefficient ds the ratio of the covariance to one 
of the variances. A prediction coefficient is a constant value which can be 
employed, in an equation, to predict variability in one set of numbers from 
variability in the other. 

The constant, b,» is derived, as follows: Given paired values of x and y, 
we wish to find a value for the constant, 5,4, such that we can compute a 
value, y’, for each value of x by the formula, 

gm Dag 

The computed value, y", is called a predicted deviation. We wish to make the 
best possible predictions, that is, we intend that the values of y’ will, in some 
sense, be as nearly equal to the actual valucs of y as we can manage to have 
them be. With some sets of paired numbers, the value of the desired constant 
might be determined quickly, by inspection, as in the example in Table 7.5, 
where it is apparent that, when by, = 3, the predicted values, y’ = 3x, are 
identical with the actual values of y. That is, y' = y in each prediction. 


Table 7.5 


PAIRED DEVIATIONS WHICH 
PERMIT PERFECT PREDICTION 


x y y = 3x 
1 =3 -9 —9 
2 —1 —3 —3 
Pairs 3 0 0 0 
4 2 6 6 
5 2 6 6 


Note that by; = 3. 


Usually the value of the constant cannot be chosen so easily and with such 
perfection in the predictions. Examine the paired values of Table 7.3 and also 


those of Table 7.4. It should be apparent that there is no value for by, which 
will permit the computation of predicted values exactly equal to the given 
values of y. No matter what value is assigned to bys there will be errors in 
the predictions. We shall define an error of prediction as the difference 


between y and y’; that is, as 

(y — y) = — bue 
choose a value for byz, which will make the 
“ best " prediction. We have not said, exactly, in what sense prediction can 
be the best. A criterion commonly invoked to define “ best ” is that the sum 
of squares of errors of prediction shall be a minimum. In applying the 


criterion, it is the quantity, 
Sy — YP = 20 — by) 


Keep in mind that we wish to 
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Which is to be minimized. The expression can be written as 


Ay? — 2b,,xy + b2,x2) = Zy? — 2b, Zxy + b2.2x*. 
Reordering the three terms on the right gives 


bi.Xx? — 2b,,Exy + Xy? 


If we multiply the expression by Xx? 


and then indicate division by 2x?, we 
obtain 


Bio Ex*)? — 2b, Xxy Xt +. Ex Dye 
i ] Ax? 


s the original expression. We can also add = 
subtract the same quantity, ( Zxy)?, in the numerator without changing 
value of the expression, Adding and subtracting (2xy)? yields 

by 2x?) — 25, Zxy Zx? + (Zxyy — (Zxyy + 2x? By? 

oO Ax? 

s ` ial 

The first three terms of the numerator constitute the square of a seemed 
and the numerator can be rewritten making the binomial explicit as indica 
in 


Coax? — Zxyy — (2)? + rays 


x? 


choice of a value for Dyes 


Which, since it is a square, must be equ 


al to or greater than zero. The function 
will be a minimum when 
(b, Ex: — Zxyy = 0, 
in which case 
De xl oxy =0 
and 
2xy 6 

Drs $2 (7.6) 

Thus it can be said that 
O — bx) < Ay — ax 

when a x p 


m 
If we divide the numerator and the denominator of the fraction in formula 
(7.6) by N, we obtain 


"The student may wish to review an earlier us 
that Z(y — Mz 


€ of this method, in Chapter 4, to show 
X — ay where a « M. 


z 


Another Criterion for Prediction i 


Formula (7.6) is a convenient computing formula for bys if one first obtains 
xy by formula (7.3), 
ZXZXY 
Zxy = ZXY- -N ` 


and Zx? by formula (4.2), 
n ZO 
De = ANS e N i 
By similar procedures we could show that bzn, the constant for predicting 
x from y, is given by the formula, 


Exy 
by = BP (7.7) 
and that 
V, 
bey = TR 


Another Criterion for Prediction 


" ; iances: 
Consider again the prediction of y from x. We can write two covariance 


One involving the actual deviations of x and y, 
Axy 
aT 
4nd another involving the actual deviations of x and the predicted deviations, 
Zxy 
Ves = UN d 


; o be equal, we 
If we impose the condition that these two covariances are t q 
iterion. We write 


c 
an solve for b,, to satisfy this cri 


yx 


Viy = Vey 
fr TN 
9m which it follows that 
Xy _ ZI 
w N 
and 
Zxy = EX 
Since y — b,x, we can substitute for y' and obtain 


Zxbax = P9 
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The constant, 5,,, can now be written before the summation sign and both 
> Dyzs " à 2 
sides of the equation can be divided by Zx? to yield 


Bay 
| AXg 
We observe that the same value for b,, satisfies both criteria. It makes the 


den ie x z 
sum of squares of errors of prediction a minimum and the two covariances, 
V,, and V,,,, equal. 


bys 


Summary 


The prediction coefficient, 
oxy 
dal 


is the constant in the prediction equation, 


b 


y! em bu 


We may substitute given or actual values of x in the equation and pies 
Predicted values of J’, which have the Property that the sum of squares o 
errors of prediction, 


i Ay — yy, 
Is à minimum. 


The equation for predicting y from x can be written in terms of original 
Values, X and Y. If we lety = y' — M,and x = X — M, then the equation, 


Y = byx, 
becomes, after substitution for y' and x, 
"bs P My sb Ma). 
Y cb by. M, + M, 
and 
Y = byeX + (My — b pM.) (7.8) 
Notice that formula (7 8) has the form o 


gularities in the paired 


rmit anything better than a crude approximation in the 


Xpression of the relation, 
zm the constant in the equation, 


xX = bey, 


As regards b 
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substituting actual values of y permits computation of predicted values of 
x’, which have the property that the sum of squares of errors in predicting x, 


Ax — xy, 
is a minimum. The equation can be written 
X! = byyY + (Mz — bs, M). (7.9) 


Formula (7.9) represents an approximation in expressing the relation between 
X and Y as a linear function. 


Graphical Representation of Prediction Constants 


After b,» has been computed for given paired values, the equation, 
y = by 


can be plotted by assigning two or more values to x, computing corresponding 
values of y’, and treating these pairs of values, x and y», as the coordinates 
of points located in the scatter-plot of x and y. A straight line drawn through 
the points, (x, y’), is called the best fitting line or the line of prediction. The 
coordinates of any point on the line give a value of x and the corresponding 
predicted value of y’. The line always passes through the point, (x — 0, 


y' 20) 
The prediction coefficient, bys» is computed below for the paired values in 
Table 7.3. 
Zxy = 52, 
Zx? = 62, 
in 52 
=> = 0.84. 
bye = @ = 


Fig. 7.3 Plot of the best fitting straight line for the scatter-plot of Fig. 7.1. 
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We shall assign values of —4, 0, and 7-4 to x, and compute the corresponding 
values of y’ by the formula: 


y! = byx. 


The three resulting pairs of values, (x, y^), are (—4, —3.36), (0, 0), and 
(+4, +3.36). In Figure 7.3, these three points have been plotted in a repro- 
duction of the scatter-plot of Figure 7.1, and a straight line has been drawn 
through the three points. Notice that the points of the original scatter-plot 
do not fall on the line. If one draws a vertical line from any point to the line 
of prediction, that distance Tepresents an error of prediction, 


(o —y)-2(y— Dax). 


The constant, b,» = 0.84, is the slope of the line of prediction. It indicates 


the vertical change per unit horizontal change in the line of prediction. 
Notice that the slope has a Positive sign, 
The prediction coefficient, b, for the values in Table 7.4 is 
2xy 0 
be = a= = 0. 
Zx? 62 
Since 5,, = 0, all predicted values, y' 
Prediction will be a horizont 
ing to note that, since b 


» will be equal to zero and the line of 

al line coinciding with the X-axis. It is interest- 
=0 

m , 


d (y — by,x) =y 
an 


Table 7.6 
PAIRED VALUES OF x AND y 


(Third Example) 


x y 

1 =i 2 

2 0 =2 

" 3 4 =3 

P; 

‘airs 4 2 —2 

5 —3 3 

6 =2 2 

Figure 7.4. The Covariance js 
MEL NETS 


id NU Ux 79-545 
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Y 


Fig. 7.4 Scatter-plot and line of prediction for the paired values in Table 7.6. 


The prediction coefficient, bys, is 
Axy | —31 
bis = za = gg = 0l 
The line of prediction has been drawn in Figure 7.4 through the coordinates: 


(—4; +3.64), (0, 0), and (+4, —3.64). 
Notice that the points of the scatter-plot of Figure 7.4 lie predominantly 


in the second and fourth quadrants, that the covariance is negative, and that 
the slope of the line of prediction is negative. 


Logical Properties of the Correlation Coefficient 


By definition, 
ey 
Fey — NC 

the product moment of Zz and Z,. If we substitute x/S, for Z, and y/S, for 
Z,, we obtain 


TAL 
Li Ss Sy 
which can be written as 
Zxy _ ZIN 
lzy = NS,S, — Say 
and finally as 
"CUM Vow (7.10) 
WE Vay 


Formula (7.10) gives the first property of the product-moment coefficient 


Product Moments 
142 


of correlation: The product-moment correlation coefficient is the ratio m 
covariance to the square root of the product of the variances. Since t - 
root of the product of n numbers is the geometric mean of the n numbers, 
AVV, is the geometric mean of the two variances. 

Formula (7.10) can be rewritten, as follows: 


JVV, | JV,V, VNV,NV, 
Since NV, = Xx! and NV, = Xy? 
2o Ay, (7.11) 

4 2x? xy 
Formula (7.11) is a convenient com 
products in the numerator, Zxy, 
sums of squares in the denomin 
formula (4.2). 

Let us consider a second property of r,,. 
paired values, X; and Y, that we wished 
from variability in the other, 
in terms of Z, and Z , obtain 


Vey 


puting formula for r,,. The sum of 
can be obtained by formula (7.3). The two 
ator, Zx? and Zy?, can be computed by 


Imagine that we had two sets of 
to predict variability in one set 
but that we formulated the prediction problem 
ed by transformation from Y, and Y;. If we let 


Z, = an actual Z-score for variable Y: 

Z, = an actual Z-score for variable Y;; 

By. = the constant for predicting Z, from Z,„; and 
Zi = 


a predicted Z-score for variable Ü 
then the prediction equation is 


re : Z, = Pus, 
An error of prediction is 


= (Z, — B,Z y. 
quares criterion in sol 
bys, we find that 

_ 22,2, 


wr =r =F, 


Thus it turns out that the product-moment correlation coefficient is the 
prediction constant in the equation, 


ap ving for the value of f,,,, just as 
we did in solving for 


ay 


Sy Buti, 
and also in the equation for Predicting Z, from Zu 


Ze = BaZ, 
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A third property of r,, is given by the equality, 
Fay = NRE A 

which can be expressed, as follows: The product-moment correlation 
coefficient is the geometric mean of the two prediction constants, by» and 
b... This relation can be proved by substituting Zxy/ Ex? for b ys and Exy/ Zy? 
for b,, in the equality stated above. The substitutions yield 

[Bay xy | SZO 0 Rx» 

Je = J XGA) OO Jx Ey J 2x8 Ey 
rived earlier. 


which is computing formula (7.11) de 
d in the following equality statement, 


A fourth property of rz, is indicate 
Fey = lyy = Paz 
ows: The coefficient, rey, is the correlation 


which is to be interpreted, as foll 
ues of either variable. Consider r,,.. We 


between actual and predicted val 


can write 
ayy’ 


toy = NP 


Substituting b,x for y’ and simplifying yield 
Lyb yeX bysEXy | Zxy " 


^w = TSP Vb eS VEZ 


A similar proof could be given showing that 


Fg = Taye 


has to do with two standard deviations: the 
and the standard deviation of predicted 
the prediction constants, by, and byy 
tions. Predicted deviations exhibit 
ms of a variance and a standard 


The fifth property of rev 
standard deviation of actual values | 
values for either variable. In discussing t 
we defined y’ and x’ as predicted devia 
variability which can be measured in ter a 
deviation. The variance of predicted values of y is 

Sy" 
pe a 
The standard deviation of values of y” is, of course, 
a ae 
The product-moment correlation coefficient is the ratio of the standard 
deviation of predicted values to the standard deviation of actual values of Y. 
That is, 
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which statement can be proved, as follows: 
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Sv _ We HEP i2» 
Sy, JEgpN ^ ja> 


Since y' = b,,x, we can substitute and obtain 


LIOS MiB bu IE 


Fry 


JE 7 qme 7 mp 
Substitution of. Xxy| Ex? for by» gives 
Zxyy Xx oxy 
NE E = is 


ry 


O eJI JEE 
The proof is concluded. A similar proof could be given that 
= 


The sixth Property of ra, concerns the relation between Xy? (the actual 
variability in Y), X^ (the Predicted variability), and ay — y? (the 
variability of errors of prediction). The first step i 


n determining the relation 
involves writing the identity: 
y=y+(-y’. 
Squaring both members of the identity gives 
P y + 3y( — y) 4 (y — yp. 
on the left and on the right, and sim 
ay = Ay" 4 Z2y( 


Examine the Second sum on the right. We can remove the ** 2”, substitute 
bux for y’, and obtain 


Summing N terms, plifying produce 


y —y)- 2X(y — yy. 


+ 22Zb,x(y — byux) 
Which is equal to 
22b,xy — 223b?.x? = 2b 


max — 2b2,Zx?, 
The quantity, Xxy| Dx? 


; can be Substituted for b 


ve» producing 
XX» — ABxyye Dye 
a? c 


DU LXX A Dey? m 
Ri (2x) = Ax? 0777 we Me 
Thus the second term in the right member of the original identity can be 
eliminated, yielding the equality: 


2y = Dy + Sy — ye 
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Dividing by N, we obtain 
; V; = Vct Vy-y- 
hus it can be said that the variability in Y can be analyzed into two additive 


m das the variability which can be predicted from X and the variability 
hich cannot be predicted from X. A similar statement could be made 


regarding the prediction of X from Y. 


In the discussion of the fifth property of rz» it was shown that 


Sy JV 
fn = vg — JV, 
S, «M 
s y N*v 
quaring reveals that 
Vy 
tay = y, 


I 
f we take the equality, 
y, = Vy + Vs- 


maid ' 
d divide both members by Vy, we obtain 


Y 
an , i f 
curd statement which can be interpreted as saying that e re a 
variability in Y consists of two proportions: the proportion tha! 


Spee and the proportion that cannot be predicted. essere since 
are of the product-moment correlation 


ty = y JV. 
coem. Vu we can say that the squ t can be 
ee is the proportion of variability in one set of rine uantity 
n icted by a linear equation from variability in another set. (The q A 

^ IS Sometimes called the coefficient of determination.) 


Ince p2 H 
"ry = V,/V,, we can write 


and 

1 — thy = "Vy d. (Th 
silt dicted. (Ihe 

© obser ... of variability not Pre 

Y ve that (1 — r2,) is the proportion ^ „mination. 

pu ( ~ re ) A al es d us coefficient of wc pere P 

he Ty) à fi its of rev t was S 

that Seventh property has to do with the lim i 


fro, 
Dot; 
Which it follows that 
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1 

T P iux : 
Since variances are positive (or zero), V,-.,', and Vy will be positive (o 
zero). Then (1 — r2,) must be positive (or zero). That is, 


(=r zx 0. 
Then " 
Iz 
and 
lE nl 


It follows that 
—] Sr, = HL 


The product-moment coefficie 
having a value of =L F 


The eighth property o 


nt of correlation is a dimensionless number 
1, or some value between these two limits. 


f ray is that the coefficient can be expressed as a 
function of the pair differences, Y — y. Given two sets of numbers with a 


certain amount of variability in each set, then the smaller the variability of 
the pair differences, the greater ray will be. The greater the variability of the 


pair differences, the smaller "zy Will be, algebraically, The proof follows. 
Consider the identities: 


Ax — y} = Txt — 2xy + y?) 
and 
A(x — y} = Dye — 2Zxy + Xy? 
Transposing terms in the second identity gives 


22xy = Ix? Xy — By — I. 
Dividing by two yields 


Zxy = 28 + Xp a Seat n 


Examine the quantity, X(x — yp 


; in the numerator, It is the sum of squares 
of differences between deviations 


. It can be shown to be equal to the sum of 
ces, Y — Y, from the mean difference, 
A(X — yy. 
By definition, x = y — M,andy = y — M ,. Then 
"ie 0 = XU Me) eps M ZIX — Y) — (M, — MP. 
Let (X — yj = D, a difference. The 


n the mean difference, Ma, can be 
expressed, as follows: 
ZD _ZX-Y) y sy EX XY 
M, er ee Se =e) ees, S cs eon = è 
a=W N N CUN ` y= Me — My 
After substitution of D for (X — y) and Ma for (M, — M ,), we observe 
that 


Zæ -y = XD — Mi} = Za? 
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where d = D — Mi. Substituting Xd? for X(x — y} in 
dx? + Ly? — Bx — y 
Be == y : (x — y) 


yields 
Ax + Ly? — Ld? 
pe LAM i 


We divide by N on both sides to obtain 
Vs: Kam Vy 
2 


Ve y 


Dividing on both sides by „V.V, yields 
Vey V, + V, — Va 

JV, 0 ME, 
If we consider V, and V, fixed and Va allowed to vary, then it is evident 
that r,,, becomes smaller in algebraic value as V, becomes larger, and that 


ry, becomes larger in algebraic value as V, becomes smaller. 


= Tey 


Coding and Its Effect on 
Product-Moment Correlation 


Consider first adding (or subtracting) a constant. Let 
X= X fia 


We wish to examine the relation between ray and rxy. We can write 


Zxy 
r. SSS 
di A Ex" Sy? 


When X' = X + a, My = Mz +4, and 
ynY-M.eQXR)-0ntg)-X—-M.-x 


Therefore, 5 
Xy 


Tus = Fay 


and we conclude that adding (or subtracting) a constant does not affect the 


product-moment correlation. as 
Consider next coding by multiplying (or dividing) by a constant. Let 
xy’ = aX. 
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Again we wish to examine the relation between r 


zy and r,,. We begin by 
writing 


When X' = aX, M, = aM, and 


x= X'— My, =aX — aM, = a(X — M,) = ax. 
Then 


Zaxy aXxy 2xy 
yey = — T 


wr —— = 


=> —————— ——— ‘ian = yy. 
y Zex? Dy? af D JED = 


We conclude that multiplying (or dividing) by a constant does not affect 
the product-moment correlation. 


Special Formulas for Product-Moment Correlation 


A special computing formula can be derived for the correlation between 
two variables, Y and Y, 


» When one of them is dichotomous. Consider the 
formula, 
2xy 
fig e an 
J 2x2 Dy? 
where 


Zxy = XXY _ ZXZY 


Zx = Dye _ (EES 
and N 


ula is based on the fact that a dichotomous 
variable can always be coded to values of 0 and I. Let X’ be the coded value 
1 


Xs. The coding required involves subtraction 
X), as follows: 


m m 
Xf uoo da 


of X 


(Xs — X, 
X, =- 7% 
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Aft " ; 
cm the two values of Y are coded, the bivariate distribution can still be 
sented by Table 7.7 except that X, should be changed to X, = 0 and X, 


should be changed to X7 = 1. 
Table 7.7 


A BIVARIATE FREQUENCY DISTRIBUTION 
WHEN ONE VARIABLE IS DICHOTOMOUS 


A f,  Sums 


Note: fa + f. = N- 

e coding operations performed 
ffect on the correlation between the 
the changes which can be effected 
lation between X' and Y, 


Recall that it was established earlier that th 


tw à A 
Hs i variables. That is, rwy = "ew 
he formula for the product-moment corre 


meu 
j y Ex ) 
let us examine the quantities: ZX; sx", and ZX'Y, in 
Ex'AiY 
Xxy = xx" i 
and 
Ax = mE = N 
Since X! = 0and Y; =1 the sum of YX’ will be simply the frequency of Nu 
Which is the sum of the frequencies in the second column of Table 7.7. In 
Symbols, 
N à ë r r Er 
Py càEfaxa fX = S f) + £D = Z fis 
1 j EU j j j j 
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Let us call the sum of the frequencies in the second column, f.s. Then 
N r 
Z X' = Efa = fa 
1 j 


The sum of squares of coded values, XX”, will also be equal to the 


frequency of Xj, which is the sum of the frequencies in the second column 
of Table 7.7. Thus 


1 


Ex" = BX + Bax? — Leo + Sy,0y = Z fa fa: 
j j F j 


ZX"Y, the sum of products of Y' and Y, is equal to the sum of the iced 
of Y which are paired with Yj. This quantity—let us call it Q,—is the su 


of products of the frequencies in the second column and the Y values along 
the left margin of Table 7.7. That is, 


N r y. 
Q, = ZX'Y = Ef XY, + E f XL Y, 
T g 


but X = 0 and X, = 1, therefore 
N r 
Q,— Zx'Y = È fa Yy, 
J 


We can now substitute f., for ZX', f for SX", and Q, for ZX'Y in is 
formulas for Zx'y and Dx", After these substitutions, the formulas becom 


N 

and 
2 
Ex" = fs NUS 


Then the computing formula for fey When X' 


is a dichotomous variable 
which has been coded to have values of 


O and 1, can be written 
ro, Qs - fZYIN 
ym 
Ns — ZIN) ye 
Or as 


= -Alu Sa (7.12) 
VNF — Nf) Sy 

To compute r,., by means of formula (7.12), the following values are needed: 
ZY — the sum of the N values of Y, 
Zy? = the sum of Squares of deviations for the N values of Y, 
Q, = the sum of the f., values of y Which are paired with X» and 


Vary 


Ka» = the number of values of X,. 
A product-moment coeffici 


ent of correlation computed by formula (7.12) is 
often called a point-biserial correlation coefficient. 
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There is also a special formula for the correlation between two variables, 
X and Y, when both variables are dichotomous. Let the variable X have two 
distinct values, Y, and Xə. Let the variable Y also have two distinct values, 
Y, and Y,. Then the bivariate frequency distribution will have the form of 
distribution A in Table 7.8. 

Both variables can be coded to values of zero and one as indicated below. 

Xt e Qn — I — Xx) = 0; Xj = (Xa — XDI: — X) = 1. 

Yi 2G — Y9/(Ys — Y) =9; Y, = (Y. — YDY: —- Y) = 1. 

Since coding does not affect correlation, rwy = "zy 

After the coding of both variables, the bivariate frequency distribution can 
be represented as distribution B in Table 7.8. . 

To compute ry we need the quantities: ZX’, 2X", ZX'Y', ZY', and 
BY" Since X! =0, X= 1, Y; — Gand Ya = b 

ZX' = faX! + fX: =f,0) +f) = fix, the frequency of values 
of Xa, 

Xy = f Xt + faX? =S + Fal? — fo 

ZY' = fa, and 


Ny" = fi. 
5 Table 7.8 


A BIVARIATE FREQUENCY DISTRIBUTION 
FOR TWO DICHOTOMOUS VARIABLES 


1 p 
N "t. fia + fa + fos 
B 
Xi X 


fa, fs 
N — fut fis + for + fn 


Consider next the sum of products of the coded variables. We observe that 
Z2Y'Y = fa X,Y! t fnXiYs 4 fasXLYA A fX SY — 0 t0 F 0 + fos. 
The sum of products, XXx'Y', is equal to fas, the frequency with which values 


of X, are paired with values of Ys. 
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p ZY’ and SY", and fos for 
ituti s for ZX' and ZX", f, for EL 
sory in ced for sums of squares and products of deviations yields 


2 


Xx =f; = 


"E 
2 Ji 
Ay =f, — NO 


and 


Safo 
2x'Y =f —. 


When these values are substituted in formula (7.11), the formula for rey 
becomes 


2 Soo ~SafolN 
"ex = Wa — FRING — FEIN) 
i " Nor — fofa (1.13) 


VN. — FEN. — £2) ka 
By substituting ( f, + fia + far + foe) for N, (fis + fos) for fs, vip te 
fez) for fy, in the numerator of formula (7.13), (fa + fa) for N in Pres 
factor and (f), + fe.) for N in the other factor of the denominator, an tes 
carrying out the multiplication and combining of terms, we can obtair 


computing formula, 
bey cain — fuf 
: Aff. fi fa. 
Inspection of formula (7.14) will reveal that, when both variables are 
dichotomous, the computation of the product-moment correlation coefficient 
requires only the frequencies for the four possible combinations of values, 


11 J22» Sia, and fa,, and the frequencies for each variable: f, and f, for X; 
fi. and f, for y. 


(7.14) 


Other Transformations ; Their Effect on 
Correlation 


A nonlinear transformation of X, such as 
Y =% 


may, on occasion, improve the prediction of a variable 
values of ¥ and 
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tion has the effect of producing a linear relation between Y and Z. That the 


Table 7.9 


PAIRED VALUES ILLUSTRATING 
A NONLINEAR RELATION 


X Z 
1 20 20 
2 .30 .30 
3 40 44 
4 .50 62 
5 .60 84 
6 410 1.10 , 
7 .80 1.40 
Table 7.10 


PAIRED VALUES FROM TABLE 7.9 
AFTER TRANSFORMATION 


y =X" Z 
1 .04 220 
2 .09 30 
3 16 44 
4 325 .62 
5 .36 .84 
6 .49 1.10 
3 64 1.40 


trend in the scatter-plot is rectified is apparent from an inspection of the plot 
of X and Z before transformation and the plot of Y and Z after transforma- 
tion. The plots are shown in Figure 7.5 and Figure 7.6. 


1.60 
° 
Z . 
. 
205 x 8 


d values from Table 7.9. Notice the curvilinear trend. 


Fig. 7.5 Plot of the paire 


such as 


A nonlinear transformation, 
Y= AX, 


154 Product Moments 
will also sometimes rectify a nonlinear trend and improve the prediction 


1.60 
. 
z . 
. 
29 4— 1— —— + + + 1 
C4 2 64 


Fig. 7.6 Plot of 
formation. 


the paired values from Table 7.10. Notice the trend is linear after trans- 
involving a simple linear e 


quation. Given the paired values of W and X 
shown in Table 7.11, for w 


hich the correlation is 0.987, the transformation, 


Table 7.11 


PAIRED VALUES BEFORE AND AFTER 
TRANSFORMATION OF ONE VARIABLE 


w X Y=yX 
1 2 4 2 
2 3 9 3 
3 4 16 4 
4 5 25 5 
5 6 36 6 
6 7 49 7 
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Y = A X, yields values of Y which are linearly related to W with a correlation 
of 1.00. Values of Y are also shown in Table 7.11. The plot of W and X 
before transformation is given in Figure 7.7; the plot of W and Y after 
transformation of X is given in Figure 7.8. 


1 
9 Y -vx 


Fig. 7.8 Plot of Wand Y for the values in Table 7.11. Note that the trend, after trans- 


formation of X to Y, is linear. 


Substitution of Ranks 


be used for computing the product-moment 
been substituted for the original values of the two 
variables, Y and Y. Given N pairs of values with N distinct values of X and N 
distinct values of Y, the first M ranks will be required in the substitution for 
- each variable. It follows that the variance of the ranks will be the same for 
both variables and will be the variance of the first N integers. 
In Chapter 4 it was stated that the first N integers have a mean of (N + 1/2 
and a variance of (N? — 1)/12. The formula for the product-moment coefficient 


computed from differences was earlier found to be 
_ Vet Vs — Va, 
ta = OVW 
ation between two sets of ranks, 
and Vp be the variance of the 


A special formula can 
coefficient after ranks have 


If we let Rey be the product-moment correl 
Vp be the variance of the first N integers, 


difference between paired ranks, then 
2Vr — Vp Vo ines Vp 1 6Vp 
War 1 — ys XN: — n NL. 


R= 
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If we let D = R, — R,, a difference between the two ranks in any pair, then 
— al y . 
the variance of the differences, Vp, will be defined as 


ID — Mp 
N 


where M p is the mean difference. Then 
2(R, — R,) ER, ZR, 
Mo ON N N 


since the means of the two sets o 
above, now becomes 


f ranks are equal. Vp, which was defined 
Ap 
Nc 


Substituting 2'D*/N for Vp in the formula for Rzy above yields the computing 
formula in final form: 


=i 6ZD — (1.15) 
Rey =1— NW? — 1) 


It should be apparent that com 


puting A,, for two sets of paired ranks, 
When there are no ties in rank 


is exactly equivalent to computing the eel 
moment correlation coefficient from the paired ranks. R,, is no more than 


Convenient computing formula for product-moment correlation involving two 
sets of ranks with no ties. 


Restrictions on the Magnitude of r, 


If the number of distinct values of y is not equal to the number of distinct 
values of Y, then 


and r,, cannot 
becomes 


the num 


That this restriction must nip 
predictions and their accuracy. I 
han the number of distinct values 


re dichotomous, t 
(7.14) which was derived earlier, 


È EE/TES — ifs 
ay = UTR. 


SaS afi fs, 


zy can be computed from formula 


Restriction on the Magnitude of ray 157 


The magnitude of ry is restricted when f.a 4 fz. in which case it will also be 
true that f, # fı. since 


fa Tf N —f. +f- 


Table 7.12- 


PAIRED VALUES ILLUSTRATING 
A RESTRICTION ON rzy 


X Y 
1 1 1 
2 1 2 
3 2 3 
4 2 4 
5 3 5 
6 3 6 
7 4 7 
8 4 8 

rzy = 0.976 


Examination of the 2 x 2 tables in Table 7.8 will reveal certain interesting 


relations among the frequencies. When f; < fe» then the largest value 
Possible for fz the value which will make rz, as large a positive value as 
possible, is fẹ, in which case fi; — 0 and fi, =fr Formula (7.14) can then be 


written 


on did = Sea (7.16) 
= A faf afi. fs. A fa fs. 


the maximum positive value of ray when f? < fe^ 


Formula (7.16) gives 


Exercise 7.1 


1. Compute the 10 possible covariances for the five variables given in Exercise 4.1. 


Example: 
ZV = 167 
SW = 1,723 
XVW = 100,712 
1,167)(1,723 
me x Ep De = (L16072 = 16,931.125. 
Vow = m = 705.46 


2. Compute bz», bzw Pex and bzy. Plot the equation for predicting Z from each 


of the other four variables. 


Example: 
Vow = 705.46 (from above) 
Vy = 835.40 (from the example in Exercise 4.1) 


V... 105.46 _ 
bo, — y, ~ 835.40 ~ 0:344 
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The prediction equation in terms of original values is 
W" = ba,V + (My — bu, M). 

The prediction equation written in terms of original values was derived earlier and 
presented as formula (7.8). To plot the equation, one selects three values for V 
and computes the corresponding values of W’. The resulting three pairs of values 
are then used as the coordinates of three points which can be plotted. A straight 
line can then be drawn through the three points. . 

We shall use, as three values for V, 10, 50, and 90. M, is equal to 48.62; My is 
equal to 71.79. The prediction constant, bw» is equal to 0.844. By substituting 
these values in the prediction equation given above, we obtain the three values of 
W^: 39.19, 72.95, and 106.71. Thus the coordinates of the three points are (10, 
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39.19), (50, 72.95), and (90, 106.71). The points have been plotted in Figure 7.9. 
134 
w 
30 
0 
» 100 
Fig.7.9 Plot of the prediction equation in the example of Exercise 7.1. 
3. Compute the 10 possible intercorrelati i iven in 
Exercise At, ions for the five variables given 
Example: 
ZV =1,167 
ZV? = 76,795 
ZW = 1,723 
ZW? = 143237 
ZVW = 100,712 
Formula (7.11) is y =A 
__ Zw 
w = a 
Ey = sy ZY — (1,167)? 
Vi — <P> = 76,795 p  — 20,049.62, 
Zw = zwi. (ZW) _ (1,723): 
w 143,237 24 E a 19,539.96. 


Zw = zyy _ ZVIW 


_ 1,167)(1,7 
100,712 — ( Y 23). 16,931.12. 
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Then 
16,931.12 


"ow = 00,049.62 109,539.96) — 0.855. 


A Look Ahead 


moments were introduced and certain 


special values, including the prediction constants and the coefficient of cor- 
relation, were discussed. The effects of coding on correlation were examined 
and some special formulas for product-moment coefficients were presented. 
Throughout the entire discussion of the present chapter, in every considera- 
tion of the predictive relation or of correlation, only two variables were 
involved. In Chapter 8, which is next, the methods of prediction and correla- 
tion are extended to more complex relations involving three or more variables. 
The three major topics of Chapter 8 are partial correlation, multiple predic- 


tion, and multiple correlation. 


In the present chapter, product 


PARTIAL AND MULTIPLE 
CORRELATION 


Syntactics | 8 


A partial correlation coefficient is the product-moment correlation of two 
sets of paired errors of prediction, For example, one might have the classifica- 
tion of values shown in Table 8.1 and carry out two sets of predictions: the 

Table 8.1 


A CLASSIFICATION OF VALUES 
IN N ROWS AND THREE COLUMNS 


Columns 

Ww X Y 
1 W, Xx Yi 
2 W, X: Y, 
3 W, X; Y; 

Rows P : 

i Wi X; Y; 
N Wy Xx YN 


t in two sets of paired errors of prediction: the 
errors, Q — byyw), in Predicting Y from W, and the errors, (x — byw), in 

k e would be N errors of prediction in each set and 
ired because two predictions would be made from 
d consequently two errors would be associated 


arranged as indicated in Table 8.2. 


We may now ask to what extent these two sets of errors are correlated. 
If we let raya be the Product-moment Coefficient for the two sets of errors of 
prediction, (x — 5. w) and O — bow), the desired product-moment formula 
can be written 

Fogg = oe = Baily — 
af SES bw) Sy 
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The possibility of expanding the expressions in this formula, simplifying 

them, and then substituting for the constants should be apparent. The 
Table 8.2 


A CLASSIFICATION OF TWO SETS OF ERRORS OF 
PREDICTION 
Columns 


Errors in predicting X from Ww Errors in predicting Y from W 


l (x1 — bam) On — byw) 

2 (xs — BewWs) (ya — BywWs) 

3 (xa — bewWa) O's — bywW) 
Rows : . 

i (x; — [m Wi — buwi) 

x (xy — bati) Ox — bus) 


derivation will not be given here but the convenient computing formula 


Which results is 
— F'aulyw. 


(8.1) 


lryw = 7] 


ree values which must be computed from 


Notice that formula (8.1) contains th 
ion coefficients for X, 


the original data: the three product-moment correlat 


Y, and W. 
The partial correlation coefficient, lxy.w: is the product-moment correlation 
between variability in X not predictable from W and variability in Y not 


predictable from W. The symbol, zu, is often read “the correlation between 


X and Y independent of W." 


Multiple Prediction with Deviations 


on equation, 
y = by, 


which was discussed in Chapter 7, involves ony, 
variable to be predicted and X, the variable from whi 
made. The prediction constant, Dyes is given a value, 
computed from the paired values of X and Y, an 
quantity, " 
y -=F = DO — b, 
prediction. 
n be formula 


The simple linear predicti 


nly two variables: Y, the 
ch predictions are to be 
Xxy| Ex, which can be 
d which minimizes the 


the sum of squares of errors of 


Linear prediction equations cà ted for more than two variables. 


2 Partial and Multiple Correlation 
16 


Suppose we were given a classification of the variables, A j: and r oe 
to the classification of Table 8.1, and wished to predict Z eee € 
simultaneously. For convenience, let us call Z the criterion wird 

and Y the predictor variables. The prediction equation can be wri 


z = bax + by 
where aM . 
z' =a predicted deviation for the criterion variable Z; 
x = an actual deviation for predictor variable X, 
Y = an actual deviation for predictor variable Y, 
b, = the prediction constant for variable X, and 
b a 


y = the prediction constant for variable X. 
Then an error of prediction can be expressed as 


G — z) = (z — b,x — by) 
and the sum of squares of errors of prediction can be expressed as 


I —zy-—X—bx- buy}, 
which is the quantity we will want to minimize by a proper choice of values 
for b, and b,. The procedure involved in applying the least squares criterion 
here is anal plicated than, the procedure in Ewe 
earlier applications: the demonstration that the sum of squares of deviations 
from the mean is smaller than the sum of squares of deviations from any other 
value, and the demonstration that the sum of squares of errors in predicting 


Y from X, when b,, = Zxy| Zx?, is smaller than the sum of squares of errors 
when b, 


ogous to, but more com 


not be given here in com 


ng that the magnitude o 
prediction, 


Z — bax — b,yy, 
depends on b, as indicated by the expression, 


(b Ex? — xz + b, Zxy)?, 


which must be €qual to or greater than 


Zero. The sum of squares of errors 
will have its smallest value when 
(b.Xx* — Exs + b,Zxy)? = 0, 
in which case 
b, 3x2 — Dye +b, xy = 0 
or 


be Ex? + b,Zxy = Xxz. (8.2) 


(8.2) contains the unknow. 
tly it cannot be solved. 


Note that equation 


n constant b, in addition to Ps 
and that consequen 
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The sum of squares of errors of prediction can also be written in a form 
which reveals its dependence on by, as indicated in the expression, 


(b,Ey? — Eyz + b.ExyY. 


which quantity must be equal to or greater than zero. The sum of squares of 


errors will, of course, be as small as possible when 

. (b, Sy? — Zyz + b,Sxy)? = 0, 
in which case 
b, Sy? — Zyz + b,Àxy = 9 


or 
b, xy + bZ) = Ayz. (8.3) 


Note that equation (8.3) contains two unknowns, b, and b,, as does equation 


(8.2). 
Equations (8.2) and (8.3) are r 
(a) bx + b,Exy = 2xz 
(b) be 2xy + b, dy? = 2yz 
and Syz, are values which can be 


ewritten below as (a) and (b). 


The quantities, Zx?, Z°, Axy, ZXZ, 
computed from the original classification of the three sets of numbers. To 
obtain values for the constants, b, and by this set of simultaneous equations 
in two unknowns must be solved. The student should examine a similar 


solution which was presented in Chapter 3. s m : 
The equations can be solved by eliminating by and obtaining a single 


equation containing one unknown, bz. The steps in the elimination are, as 
follows: 
Multiply (a) by Zy” to obtain 
(c) b,Xxtày* + p,ExyÀy = XxzÀy*- 
Multiply (b) by 2xy to obtain 
(d) — b,(Exy? + b Sxyly? = ExyÀyz. 
Subtract (d) from (c) to eliminate by and produce 
(e) p mezp — (oy = Day — Tarot 
Factor the left member of (e). 


b,[ 2x Ey! — (Exyl- ExzÀày — ExyEyz. 


S 
olve for bz. Brady — Mui (8.4) 
b. = Exedy — (2x? 


Formula (8.4) can be written 
nd Wu c Fan 
be = y, — Vii 
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i i i d (8.3), 
By means of a similar set of operations applied to equations (8.2) and ( 
one can eliminate 5, and show that 


AyzZx? — XxyXxz 
b s us 


xy 27 (8.5) 
"Exe Ly? — (Xxyy 
" — Vy V. — VV 
t Vly — VE, 


i the 
Formulas (8.4) and (8.5) can be used for the computation of b, and b,, 
two prediction constants in the linear prediction equation, 


Z -bu- b,y. 


Multiple Prediction with Standard Scores 


s ion, 
Consider the prediction of Z from X and Y by means of the linear equatio 
Zi = EZ. + B,Z,. 


> P res and 
Notice that the three variables have been transformed to standard scores 
that 


eck r 
the two Prediction constants are now denoted by the Greek letter B 
(“ beta”) with appropriate subscript, 


An error of Prediction is written 


(Ba — Z!) =(z, ~ B.Z. — B,Z,). 
The sum of Squares of errors of Prediction is 
AZ, — Zi} = xz — B.Z.— BZ 
quantity to be minimized. Applying the least squares afitepion 
values of B, and B, yields the set of simultaneous equations: 
PZZ} + B,Xz,z, = ZZZ, 
B.XZZ,-- B, EZ? 
Dividing each equation by N yields 


P, 
p 
Which is the 


for Obtaining 


ll 


b P 


B.XZ3N + B,XZ,Z,JN = SZ ZIN, 
B.XZ.z 


JN + B,XZ*N = ZZ,ZN. 
Since ZZ?/N = Zz? 


N=1, EZ,Z, IN = "ew ZZ, Z.[N = p, and XZ,Z.|N 
= "ys, Substitutions can be made and the equations can be written 


B. + Byr 
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The equations contain two unknowns: the prediction constants, 8, and By. 
The known values in the equations are the product-moment coefficients: rey, 
r,, and r,, which, in any specific problem, would either be given or be 
computed from the original classification of the three sets of numbers. 

One might solve for B, and B, by the same methods employed in the 
preceding section in solving for b, and b,. The procedure for eliminating one 
unknown, f, and the resulting formula for the other unknown, fs, are 


presented below. 
Given the equations, 


(à) — Be + Bye = res and 
(b) Barsy + By = Tus 
the steps in eliminating fy and solving for B. are, as follows: 
Multiply (b) by rz, to obtain 
(c) Bat2y + Boley = lulu 
Subtract (c) from (a) and obtain 


(d) Bx RES Bary = lez — Tays 
Solve for B, by factoring on the left and dividing by the coefficient of Bs. 


Bell. — rèy) = Fas — lah 


Fa; — Vey yz (8.6) 
Bs = j-r E 
By similar operations We could show that 
vs reru (8.7) 
By 1 = rë i 


nvolved and, consequently, only 
s must be determined, then the elimination of one of 
the unknowns by appropriate arithmetic operations which visis a single 
equation in one unknown is not at all difficult. When the prediction on 
involves three or more predictor variables and a corresponding number o 
prediction constants, these fundamental arithmetic scc nme. at 
lengthy and tedious, are not usually employed and formulas z Re pin 
are not generally available. be eie or more un ^ 
i oaches are wi . mE . 

greet which is appropriate for computation involving 


desk calculators, the detailed computing instructions devised. in 1878 by 
Doolittle, an engineer with the United States Coast and Geodetic Survey, or 
modifications of his instructions, are employed. These detailed instructions 
cleverly take advantage of certain standard features of Los oio 
equations in multiple prediction problems. Repetition in notation is reduce 


Where only two predictor variables are i 


two prediction constant 
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to a minimum and the entire computation is condensed and given a compact 
tabular representation. Even with the routinization and condensations of 
these procedures, however, the solution for three or more unknowns is still 
burdensome. For this reason, the Doolittle instructions will not be repro- 
duced here. The student who may require these detailed computing instruc- 
tions for a multiple prediction problem involving three or more unknowns 
can find them in a number of texts including those by Guilford (1, 406-409), 
Johnson (2, 330-337), and Kenney and Keeping (3, 298-301). 

In the newer approach, one formulates the fundamental solution in terms 
of matrix algebra and has the burdensome computation done on an electronic 
computer. To demonstrate the nature of the computer solution, we shall 
employ the simple problem involving two predictors and two unknown pre- 
diction constants. The student should understand that one would not put 
such a simple problem on a computer. The simple problem is used here 
simply for convenience in introducing the student to matrix notation. 


Matrix Notation 


For Purposes of discussing matrix notation, we shall choose the problem 
of predicti i 


cung variable Z from variables X and Y, when all three have been 


and products, and the unknowns, 5, and by, follows the same principles. 


equations for the problem in Which Z, is to be predicted 
z and Z, have been shown to be 


Bs + By. = res, and 


Bar, TB, A 


It is interesting to note th. , 
v at these various iti d 
displayed in t à quantities can be separated an 


can be displayed 
5, a 2 X 2 matrix. Notice that the 


vz: 
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Assuming that you were given the task of computing the correlations, rsz 
and r,. in the 2 x 1 matrix on the right, from the information in the other 
two matrices, what system would you use? One system, which has now 
become a standard procedure, is to take the entries from the tables and com- 
pute with them according to the instructions contained in the equations 
themselves. In the first equation, 


Bz + Bury = zs 


one is instructed to obtain the product, B.(1), and the product, B,r.,, and 
to sum these products, obtaining r,.. In the second equation, 


Pitay + By = yz 
one is instructed to obtain the product, B,rzy, and the product, B,(1), and to 
sum these products, obtaining 7: 

Examine closely how the operation of summing products relates to the 
arrangement of entries in the tables. It amounts to pairing entries in a row 
of the matrix at the left with entries in the column of the middle matrix, 
obtaining the product for each pair, and summing these products to obtain 
an entry in the matrix at the right. This operation of summing products of 
row and column entries in two tables is called matrix multiplication. The 
operation is expressible in a very compact notation when there is no need to 
show the tables themselves and the values they contain. For example, if we 
call the matrix on the left, P, because it contains the correlation between the 
two predictor variables, and call the matrix in the middle, B, for the B-values 
in it, and call the matrix on the right, C, because it contains the correlations 
with the criterion variable Z, then the entire matrix multiplication can be 
represented as 
PB=C. 

The student may find himself at a loss, initially, to react to such aa 
notation. It may help to suggest that, in looking at PB = C, he € i say 
to himself, “ P is a table of numbers; B is another table of num m i: am 
instructed to pair the entries in each row of P with the entries ee mm 
of B, to obtain the product for each pair of values, and to sum t s pro E n 
Carrying out this operation for a given row of P ES e column o 
yields a sum of products which is one entry in a cell of tal : ^ ovre 

All that we have done, up to this point, 1s explore, in a limite as hion, d 
matrix representation and notation for the simultaneous rna ^ 
multiple prediction problem. We have shown that, given P and B, Ls qo 
obtain C, but finding C is not the problem we ordinarily have to solve. The 


problem we actually have can be stated, as follows: Given the correlations, 

Tey Fan, and ryn compute B. and B,. That is, given P e be La T 
Let us examine the two formulas for Bz and y, (8.6) an (8.7), and we sha 

see that, when these two equations are written in matrix form, the computation 
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does then correspond to the problem we are interested in solving. Formulas 


(8.6) and (8.7) are given below, with the direction of the equality statement 
reversed. 


ly: =a Fr yl yz 


— ri, 
Seas + Fyz B 
l=, 


The two equations can be rewritten with two terms in the left member of 
each, both terms having a common denominator, as follows: 


The equations can be rewritten a 


gain to emphasize the entries for matrix C, 
"ez and ry, and the coefficients o 


f these values, as follows: 


l! Tay 
1 — 73, | "= + ~ T= ee |" = Bes 
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Pia uoi sque for values of B, and B, can now be expressed in 
PAC = B. 

In reacting to this symbolism, it is suggested that the student say to himself, 

po (P inverse) is a table of values computed from P, another table cone 
taining the correlation between the two predictor variables. C is a table con- 
taining the correlations between the predictor variables and the criterion 
variable. I am instructed to pair the entries in a row of P- with the entries 
in a column of C, to multiply the paired entries, and to sum these products 
obtaining a prediction constant to be entered in table B.” 

All that has been said above about the simple prediction problem involving 
a total of three variables holds for larger problems. Given a specific problem, 
the student would construct two matrices: P, a table of intercorrelations for 
the predictor variables, with l's as the diagonal entries, and C, a table of 
correlations between the predictor variables and the criterion. He would take 
matrix P to a computation center and ask to have the matrix inverted.? The 
computer service would return to him the inverse matrix, P7. The student 


could then perform the matrix multiplication, 
P3C = B, 
to obtain the f’s.* 


The Prediction Equations 


d wishes to employ prediction equations 


If one computes B, and fy, an 
btainable by the formulas: 


involving 5, and b,, the latter values are o 
S: 
b, = Bs S. (8.8) 


and 
S. 
b, —Bvs- 8.9 
y — Bv Sy (8.9) 
ation for deviations, 
z! = b,x + by 
X, Y, and Z, as follows: 


The prediction equ 


can be written in terms of original values, 
(Z' — M) = b(X — MJ + b(Y — My) 


?Programs for matrix inversion are found in all computer libraries. 

"It is possible, of course, to have the entire computation done by the computer service. 
Entries in the original matrix of intercorrelations for all variables, a combination of P and 
C, could be obtained on the computer, and the final multiplication, P-C, could also be 
done on the computer. The procedure was presented above, broken down into its major 
phases, so that the student's understanding and appreciation of it might be increased. 
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1 
or 


Z' = b,X + b,Y — b,M, — b,M, + M.. 


Multiple Correlation 


i i ct- 
The coefficient of multiple correlation, R...,, is defined as r.., the produ 


i Z. 
moment correlation between actual values of Z and predicted values of 
It is computed by the formula, 


Ez dha .10 
Ras = d Eras + Buys (6.10) 
RÈ.» is the Proportion of variability in Z predicted, simultaneously, p 
variability in ¥ and y. The quantity, (1 — RÈ =), is the proportio 
variability in Z not predicted from variability in Y and Y. 
Exercise 8.1 h 
1. Compute "zy. us the correlation between X and Y independent of W for eac 


rn 
of the correlation matrices given in Table 8.3. Use formula (8.1). Study the patte 
of relations in each problem. 


Example: 
Given ry, = —0.50, rwy = 0.50, and Fey = 0.40, then 

Lo Fay — wr! wy 40 — (~.50)(.50)_ _ 
VO — Fad = Fig) 7 WU — 35 Yi— 35j 

2. Compute the multiple correlation coeffi 


Use formulas (8.6) and (8.7) to compute 
formula (8, 10) to compute Beas: 


Fay. = 


0.867. 


cient, R; «y for the data in Exercise 4. 
the beta-coefficients, 8, and £y, an 


Table 8.3 
FOUR 3 x 3 CORRELATION MATRICES 


Ww xX Y 
"eT aE 
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Compute th i 

e coefficients, b, and 
the predicti ients, b, an by, by means of formulas (8.8) and i 
ties eon equation involving the original values of X, Y, Le Z, a sei" 
, and b, substituted in the equation die: , with computed 


Example: 

Gi -— 
"s. — US. 0.30, rz: = 0.50, and ryz 0.60; M. 
: ; Sz = 10, S, = 10, and S, = 20, then E 


50, M, 60, and 


fas — refs _ 50 — (30.60) 

B= Ta, 1-3 “08% 
Fyz — Mewes .60 — (-30)(.50) 

p, = gts = oe 0.494, 


and 
— E ; 
nay = A Bares + Byry: 
The coefficients, bẹ and by, 


= (352550) + (494)(60) = 0.687. 


are computed, as follows: 


b, = B. T- 35115 = 0.704, 
and z 
b 0 
y= bys = 49416 = 0.988 
Table 8.4 


MATRICES P, P^, AND C FOR VARIABLES IN TABLE 4.4 


P 
4 wW X Y 
0.232095 | —0.186350 y 


P 1.000000 0.855403 

d 0.855403 1.000000 | —0.1 70851 | — 0.084012 w | 0.536812 
X | —0.232095 | —0.170851 1.000000 0.833802 | X | 0.637972 
Y | —0.186350 | —0.084012 1.000000 | Y | 0537620 


P> 


—0.113690 0.546268 


3.924057 —3.330181 


— 0.663428 


0.441149 


— 3.330181 3.868282 


—2.817722 


3.398407 


— 0.113690 0.441149 


3.395484 


—2.817722 


0.546268 — 0.663428 
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The prediction equation can then be written 


Z' = bX + b,Y — b,M, b,M, + M; 


or 


Z' = 0.704X + 0.988 Y — (0.704)(50) — (0.988)(60) + 100 
or 


Z' = 0.704X + 0.988 Y + 5.52, 


3. Table 8.4 gives the P matrix for variables V, W, Y, and Y in Table A 
Exercise 4.1, and P^, the inverse of P. Both matrices were computed on the IBM ion 
in the Computation Center of Pennsylvania State University. (Actual computati - 
time for the inverse was less than one minute.) Matrix C, which contains the co 
relations with the criterion variable, Z, is also given in Table 8.4. 


Perform the matrix multiplication, 
PÜSC B: 


to obtain the matrix of coefficients: Bes By, Br, and B,. Also compute Resen ue 
bws by, and by. Write the Prediction equation using V, W, X, and Y, an 
computed coefficients, The formula for By ies ds 
R., vwry = Boros + Balu F Baz + Byrys. 
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PRODUCT MOMENTS IN DESCRIPTION 
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The following discussion of product moments, and the various d-statistics 


computed from product moments, will be based on their logical properties, 


as these were presented in Chapters 7 and 8, and will not entail consideration 
of the error to which these values might be subject in any formal statistical 
sense. As stated earlier, it is our purpose to make and maintain thedistinction 
between description and inference, as clearly as it is possible to do so. The 
descriptive role of product moments and values computed from them will be 
Outlined in the present chapter. The error properties of certain i-statistics 
computed from product moments will be presented in a later chapter. 
Although there is no operation of more fundamental importance in research 
analysis than the specification of a relation between two variables by means 
of an equation, the wide applicability and general desirability of this operation 
are seldom understood and appreciated by the beginning student in his typical 
approach to a research problem. Too often he fails to see the potential and 
the relevance of a function and will, as a consequence, try to assess his results 
by some crude and inferior method of his own devising. The beginner's own 
inventions are usually inadequate not only because they fail to capitalize on 
functional relations, but also because they often result in the discarding of 
valuable information. Two common practices which betray a lack of apprecia- 
tion of the role of functions and an inadequate conceptualization of research 
hypotheses are dichotomization of variables and selection of extreme values 
from distributions. One of the student's first goals in preparing to become à 
researcher should be the development of an understanding and appreciation 


of the great variety of situations in which obtaining à function to represent a 
relation is not only possible, but also highly desirable, as a Way of handling 
data. The student researcher's first though in planning and evaluating research 
should be concerned with the possibility of a functional analysis. 


It should be clearly understood, at the outset, that à function can be 
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i tions 
established for the relation between two variables only when = EE 
on the two variables are paired. How observations becom par eie 
obvious to the experienced researcher, but is often a rose o sedie ae 
student. Competence in research depends toa considera Sai oci et 
capacity of the researcher to detect or discern possibilities foro eec pn 
observations. Of course, this pairing of observations is never th von, The 
arbitrary decisions and arrangements on the part of the ee s » 
pairing is dictated by the observational procedures of the researc a one 
more specific, the investigator obtains a pair of observations, one [Xem 
variable, for a single object or a single event. Thus data consisting o A m 
of observations imply the measurement of N objects or events on — da 
variables. Consequently, this pairing, being the circumstance which ma i 
expression of a relation between variables meaningful, must be held inviolate. 


: A ; z ich 
To convey to the student some idea of the wide range of aiii 
yield paired observations and therefore offer possibilities for functional in 


; ; ; of 
pretation and Tepresentation, we shall list below six broad classes or def 
variables which are of continuing interest to the research psychologist. má 
each class a few specific examples of variables are named but these are 


Siga Dre a : ; s- 
described in detail with respect to the particular operations which are nece: 
sary and appropriate for obtainin 


^ e: a " 1 rea 
the student should consult texts or articles specializing in the particular a 
in question, The listing is not ex 


absolutes in any sense, Fo 


sige : à i ired 
number of combinations of variables which could be the basis for pai 
Observations and whi i 


arch. They are simply intended to enlarge the student's 

concept of functi 

Classes of vari " 

1. Data on t nt of a subject, including 
ic pressure, humidity, rainfall, altitude, 
ommunity, size of school, size of family, 
Per capita expenditure for education in the state. . 
2. Biographical data, including age, grade in School, amount of education, 
salary, number of Years married, and number of children. 


s, including height, weight, size of waist, and size of 
foot. 


of water. 


5. Series, including Successive time periods, trials, and ordinal positions of 
Successive events, 
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6. Behavioral measures, including 

a. physical measures of responses or products, such as length of standing 
broad jump, distance of discus throw, weight lifted, time required to run 
100 yards, latency of response, and area covered by a drawing; 

b. physical measures of a stimulus under standard conditions for count- 
ing discrimination responses, including intensity of a tone, voltage of a light 
source, concentration of salt in solution, frequency of a tone, and wave 
length of a light source; 

c. frequency counts of responses, including number of correct items, 
number of self-references, number of favorable statements endorsed, and 
number of “ liked ” activities; 

d. judgments of response attributes, including rank given by an expert 
judge and average rating given by three observers; 

e. measures of agreement between two sets of rankings of a number of 
items by a single individual. 

To facilitate the outlining of possibilities for functional representation, we 
shall consistently refer to the two variables in each example as X and Y, and 
shall use N as the number of pairs of observations. In every case, the 2N 
Observations could be displayed in a measurement matrix of N rows and two 
columns as shown in Table 9.1 and could be represented graphically by means 


Table 9.1 


GENERAL FORM OF A MEASUREMENT 
MATRIX FOR TWO VARIABLES 


Variables 
X Y 


Pairs of 
Observations 


Of a scatter-plot as indicated, in general form, by Figure 9.1. The coordinates 
of any point in the scatter-plot are the pair of values: X; and Y;. Graphically, 
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the function relating X and Y will be a straight line or a curve of best fit for 
the points in Figure 9.1. 


Plot of points: 
£ [t We (3e QUY) Gk Qu Y.) 


x 


Fig. 9.1 General form of the Scatter-| 


plot for the paired measures in Table 9.1. N 
number of pairs of measures — number o! 


f points in the plot. 


Typical combinations of variables whose relation might be evaluated by a 
nction given by a mathematical equation are listed below. . 
1. X = score on a science interest inventory; Y — score on a science 
achievement test; M = number of Subjects taking both tests. . 

2. X — number of years of formal education; Y — score on an attitude 
scale; N = number of subjects with attitude scale Scores. 

3. X — hours of food deprivation; Y 
terms of the number of revolutions of an 
of successive hourly periods of food depri 

4. X — the ordinal number of a trial; 
N — total number of. trials allowed an animal. 


5. X — the rank assigned to an item by a subject in describing his father; 
Y — the rank assigned to an item by a subject in describing himself; N = 
total number of items. 

6 X= physical measure of the intensity of a stimulus; Y = percentage 


of correct discriminations; N — number of different stimulus intensities used 
with a single subject. 


7. X — height of athlete; Y — distance of running broad jump; N — 
number of athletes. 


8. X = age of a salesman; 
period of time; N — number of salesmen. 
9. X — a student's reported avera 


fu 


= amount of activity measured in 
activity wheel; N = total number 
vation for a single animal. 

Y = latency of a running response; 


Y = amount of commissions for a specified 


ge amount of time per week spent in 
study; Y = a student's final average college grade; N = number of students. 


10. X — a teacher's rank in teaching ability as judged by an expert; Y — a 
teacher's salary; N — number of teachers, 


11. X = number of cups of coffee drunk daily; Y — amount of carbo- 
hydrate in daily diet; N — total number of days over which a single subject 
was observed. 
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Choice of an Equation 


In a strictly theoretical sense, the research psychologist has an unlimited 
number of different equations, from which he might choose one for the 
interpretation and representation of any given set of data. In practice, his 
Choice most often turns out to be the simple, linear equation: 


y —b,x, 


With its associated prediction constant, bys, and product-moment correlation 
coefficient, r.,. The prediction constant, b,,, indicates the number of units 
change in Y per unit change in X for the predicted values of Y, that is, change 
as a point moves along the best-fitting straight line. Since the units of many 
psychological variables are not invariant from one test or instrument to 
another, psychologists tend to report the results of their research more often 
in terms of the dimensionless correlation coefficients than in terms of predic- 
tion constants. Correlation coefficients are also employed when there is no 
desire on the part of the investigator to focus upon the direction of the 
predictive relation, as one does in using bys or by. 

Three main factors appear to account for the psychologist's frequent 
Choice of the simple, linear prediction equation. The first factor is the con- 
venience which arises mainly from the ease and simplicity of computation, 
graphical representation, and communication. The second factor is the 
Beneral background of experience in psychological research, known and 
Shared by most active researchers, which background suggests that linear 
relations are to be found much more frequently than are nonlinear ones. The 
third factor is that linear functions are often useful approximations to a rela- 
tion, even when the true function is curvilinear. It would be incorrect, of 
Course, to convey the impression here that all relations between variables, in 
Psychological research, are necessarily linear or even approximately so. The 
researcher must be ever on the alert for evidence of nonlinearity.! The student 
Should also be reminded that, because all relations are not necessarily linear, 
it does not follow that they are necessarily curvilinear. The nature of the 
relation between two variables is, in the final analysis, a matter of empirical 


determination. 


Curvilinear Prediction and Correlation 


There are countless nonlinear equations which might be employed for the 


1The simplest check for nonlinearity is an inspection of the trend revealed in the scatter- 
plot. 
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relation between two variables when a linear equation is not a sufficiently 
good approximation, but multiple-linear prediction and correlation, dis- 
cussed in Chapter 8, provide the basis for a very practical solution to the 
problem. 

To structure the problem and outline its solution, let us say that the 
criterion variable, Z, is to be predicted from predictor variable, X, but there 
is evidence of a moderate departure from linearity. To allow for this departure, 
we may manufacture a second predictor variable, Y, simply by transforming 


X, as indicated by the formula, 
F= WE 
We now propose to predict Z from X and Y, simultaneously, in the equation, 


Z' = b,x + byy, 
where x = (X — M,) = X — ZX/N and y 2 (X? — Mz) = X? — ZX*JN. 
Predicting from X and X? amounts to solving for constants in a quadratic 


equation and to the fitting of a parabola to the scatter-plot for Z and X. The 
N X 3 measurement matrix is shown in Table 9.2. Keep in mind that 


Table 9.2 


MEASUREMENT MATRIX FOR 
CURVILINEAR PREDICTION 


= 
ES 
N 


Subjects or . " Jj 
Occasions i Xi Y; Zi 


N Xy Yy Zy 
measures of Z and X are directly the result of observation, but that values in 


the Y column are simply the computed squares of X. The form of the scatter- 


plot for Z and Y is suggested in Fi i i i 
l i gure 9.2. One could im ts 
displaying a trend for which a curve, perpe 


i and not a straight line, gave the best 
fit. In the case of a simple, linear prediction equation, the coordinates of 
points lying on the line of prediction would be (Z', X) where 

Z = b,X — b,M, + M.. 
In the case of curvilinear 


a Prediction suggested above, th i f 
points lying on the curve would be (Z', X), where AS a 


Z' = bX + b,Y — b, M, — b,M, + M.. 
Do not forget that Y — X? and M, = ZX?*|N. 
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To solve for b, and b,, we might first solve for B. and B, following the 
procedures outlined in Chapter 8. The computation would involve obtaining 


Plot cf points: 


Zune. ) (XZ) C. 4 (x, Z,) 


x 


Fig. 9.2 The scatter-plot for Z and X would consist of points whose coordinates were 
Q6, Z), Gs ds (Xs Zi, C De OG Zi pairs of values from Table 9.2. The coordinates 


of any point lying on the best-fitting curve would be (.X;, Z;) where 
Z'= bX + Br b,N, — byMy + M; 
and 
y= x?. 


Fan Fez Fyz Sys Sy, and Sz Values could then be substituted in formulas (8.6) 
and (8.7) to compute B; and B. Formulas (8.8) and (8.9) could then be used 
to obtain b, and b,. It would also be necessary to compute M,, My, and M, 


for the prediction equation. 


Partial Correlation 


Given two variables, X and Y, having a substantial correlation, it is entirely 


Possible that the observed relation between the two can be accounted for, 
wholly or in part, in terms of a third variable, W. That is to say, if we remove 
from X and Y the variability which can, in each case, be predicted from W, 
the correlation remaining between X and Y may be reduced considerably in 
Magnitude. The correlation between X and Y after variability predicted from 
W has been removed is the partial correlation coefficient, rey.w We have 
Suggested that ry.» might be, but is not necessarily, less than ey. An example 
'S given below. 


An instructor of an introductory psychology class found the correlation 


between two successive course tests to be 0.70. He suspected that this con- 
Sistency of performance in the course could be accounted for by differences 
In the ability of students as it could be measured by a precourse test of verbal 
ability, He decided to compute the partial correlation coefficient for ‘the 
Course tests independent of the verbal ability test as a check on his explana- 
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tion of the consistency in performance. The correlation between the verbal 
ability test and the first course test was 0.30. The correlation between verbal 
ability and the second course test was 0.40. He used formula (8.1), 


Yy 
y — rẹ — 
where variable W was scores on the verbal test, X was scores on the first 
course test, and Y scores on the second course test. He obtained a value of 
0.66 for rey... The result supported his idea that the relation between the two 
Course tests was, in part, accounted for by their relations with the verbal test, 
but indicated that a fairly high degree of consistency in performance existed, 
independent of verbal ability. It is possible that the test of verbal ability was 
a poor predictor of course performance because it was a poor measure of 
verbal ability, but it is also possible that other factors such as differences in 


motivation and interest accounted for the consistent differences among 
students. 


Fey. = 


Practical Prediction for Selection 


Prediction equations are used widely in industry, education, government, 
and the military for the selection of personnel. The general procedure in a 
selection program involves the following steps: (1) choosing a criterion 


variable and one or more predictor variables; (2) measuring applicants or 
candidates on the predictor variabl 


the criterion; (4) doin 
tion equation; (5) 
Predictors only; (6 
performances on th 
the new group ont 

The criterion va 


productivity. The measure ma 
: y be a grade, a test i rt 
or a rating by a s grade, Score, an index of some sort, 


: à predictors. The criterion measure may 
| same time as the predict i it i 
dpi predictor variables, but it is often 


There is a major problem i i i 
rer à n this selection procedure. Noti the 
prediction equation is established on one : A yaoi 


Ict group and applied to another for 
prédiction purposes. Data for the original group consist of measures on the 
predictor variables and the criterion variable. From these data, the constants 
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for the prediction equation can be computed. Data for the new group consist 
only of measures on the predictors. The criterion performance of an indi- 
vidual in the new group is predicted by substituting his predictor score(s) in 
the equation. One of the uncertainties as to the success of practical prediction 
relates to the question of whether or not the equation applies to the new 
group. Judging from experience accumulated by psychologists, there is little 
doubt that it is sometimes applicable or appropriate, being more so on some 
occasions than on others. The appropriateness of the equation for the new 
group depends on the response similarity of that group to the original one 
on which the equation was developed. 

Another problem having to do with the appropriateness of the equation 
arises from the selection which may have taken place in the original group. 
Very seldom does an industry, an educational institution, or a government 
agency employ or accept any and all applicants. In practice there is almost 
always some screening and some rejecting. It is evident then that the predic- 
tion equation is established on a group which has gone through selection. 
The new group, to which the equation is applied, however, is usually an un- 
selected group. Ideally, the equation would be established on an unselected 
group. The effect of this departure from the ideal can be expected to be a 
reduction in the accuracy of prediction. 


Test Construction 


Product-moment correlation has important applications in the work of 
Constructing and revising psychological tests, scales, inventories, and question- 
naires, The reliability and the validity of the psychologist’s measures are often 
expressed in terms of product-moment coefficients. Item analysis and item 
Selection procedures also involve extensive applications of product-moment 
Correlation. 

Determining reliability is the psychologist’s way of assessing reproducibility. 
There are three methods in common usage. The psychologist may administer 
a test on two different occasions to the same group of subjects. The correla- 
tion between the first administration and the second is a measure of fest- 
retest reliability. Having administered a test on one occasion to a group of 
subjects, he may divide the test into two halves. A score is determined for 
each subject on each half. The correlation of one half with the other is a 
Measure of split-half reliability. Finally, he may construct two equivalent or 
parallel forms of a test and administer both forms to the same group of 
Subjects. The correlation of one form with the other is a measure of 
equivalent-forms reliability. 

It can be shown that a coefficient of reliability compares interindividual 
differences with intraindividual differences. If we express an individual’s 
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standing on a test as a deviation from the mean of his group, then his average 
standing for two administrations can be computed by adding the two devia- 
tions and dividing the sum by two. Differences among these average standings 
for the individuals in a group might be called interindividual differences. If 
we measure the variability of the individual's two standings around his 
average standing and combine these measures for all subjects, the result 
might be called a measure of intraindividual differences. When intraindividual 
differences are small, one can say that the measures are reproducible. When 
intraindividual differences are large, the measures are lacking in reproduci- 
bility. Of course, these differences are “ small " or “ large " relative to some 
standard. In expressing reliability by means of a correlation coefficient, the 
psychologist compares the two kinds of differences? Thus intraindividual 
differences are small or large by comparison with interindividual differences. 


Small, when the reliability coefficient is large. Large, when the reliability 
coefficient is small. 


Validity has a number of different meanings for the psychologist. Here we 
shall use the term to refer to concurrent validity and predictive validity, both 
of which are often assessed by correlating a predictor variable and a criterion 
variable. Concurrent validity involves criterion measures obtained at the 
Same time as the predictor measures. Predictive validity involves criterion 
measures obtained some time after the predictor measures. Computing the 


product-moment coefficient of correlation for a test and a criterion is 


the most common method of demonstrating the predictiveness of the 
test. 


The student should review the discussion of the functional approach to 
confirmation in Chapter 5 and integrate that discussion with the comments 
on reliability and validity made above. 

Many psychological tests and s 
dichotomous variables. Subj 
two categories, 
affirmative and 
in one category 
Thus an indivi 


cales consist of items which are scored as 


» With scores on the test containing 


m, and with measures on some criterion. The complex of issues sur- 
rounding item selection based on these corri 


-test correlation or an item- 


1 (7.13) could be used to compute the 
correlation between two items. 


"The logical property of product-moment correlati i - ? 
is given in Chapter 10. ion, which supports this interpretation, 
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Closing Comment 


In the frequent practical use of prediction equations and in the common 
listing of correlation coefficients in research reports, there are implicit two 
fundamental notions which deserve considerable emphasis in the training of 
students. The first is that prediction and correlation actually constitute a 
functional interpretation of relations among variables. The concept of a 
function is central to the entire development. The second notion is that func- 
tional interpretation through prediction and correlation involves the fitting 
of lines and curves to data which often display much irregularity. Even though 
the irregularity makes fitting necessary, the function concept is still there. It 
is fundamental. 


THE ANALYSIS OF VARIABILITY 


Syntactics | 10 


Classification procedures play an important part in statistical analysis. Two 
kinds of classifications can be distinguished. One kind, the frequency classi- 
fication, includes univariate frequency distributions and bivariate frequency 
distributions, which have been discussed in earlier chapters. In a univariate 
frequency distribution, a set of numbers is represented by categories which 
correspond to intervals on a quantitative scale and by frequencies which 
indicate how many numbers in the set fall into each category on the scale. 
In a bivariate frequency distribution, two sets of paired numbers are repre- 
sented by categories on two scales and by frequencies which indicate how 
many pairs of numbers are in each possible combination of categories, one 


from each scale. Later, frequency classifications in which the categories are 
qualitatively distinguishable will be considered, 


fairly simple 
variability appropriate to each one. 


Classification by Columns 


Numbers in a set can be classified 
entire set can be analyzed into two co. 


184 


into subsets and the variability of the 
mponents, 
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Let 


n = number of values per subset, a constant; 
c = number of subsets; 

N = cn = total number of values in the set; 
Y; = the i'^ value in the kt® subset. 


If we let a subset be indicated by a column in a rectangular arrangement of 
the N numbers, the general form of the arrangement is that of Table 10.1. 


Table 10.1 
NUMBERS CLASSIFIED BY COLUMNS 


Columns 
1 2 3 k c 
Yu Yi: Yis Yi Vie 
Ya Yoo Yas Yu Ys 
Yu Ya: Yas Yok Ysc 
Ya Yis Yi E] Yi k Yie 
Yn Yn: Yna : Ynk . Yne 


Note: There are no rows in the table and the numbers are not, therefore, classified by 
rows. 


The sum of the n values in the kt" subset or column can be written 
n 
2 Ya. 
i 


Notice that the index, i, governs the summation, which involves terms from 
" , m nhi 

1” to “n” within any column. 

Y, the mean of the numbers in the K^ column, is given by the formula, 


bg 
Y. — r 


The sum of all N values can be written as the sum of the c column sums, 


c n 


È E Ya 
k i 


The purpose of the double subscript should now be apparent. The index, i, 
governs the inner sum which is restricted to the values within a column. The 
index, k, governs the outer sum which contains the c column sums. It will be 
convenient to use the symbol T, for the sum of all N values. Then 

n 


c 
T, — the sum of the column sums — ZZ Yipo 


k id 


186 
Let M, = the mean of all N values. Then 
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Ty 
M, = N 
The variability in the entire set of numbers can be measured by the sum of 
squares of deviations from the total mean, M,. The sum of squares of devia- 
tions from M, for the numbers in the kt" column is 


E (Yn — My. 


Then the sum of squares of deviations from M, for all c columns is 


con 

2 E (Ya — My, 

k i a 
which quantity will be called the total sum of squares or the total variability. 


If we expand and simplify this expression, making appropriate substitutions 
as well, a computing formula can be obtained. The formula is 


c n 2 
PP AEA 
ki 
Let C, = T}/N. Then the formula for the total sum of squares can be written 
REE: POA (10.1) 
k i 


Example: 


s mn sums, means, and sums of squares 
are given at the bottom of the rectangular arrangement. 


Table 10.2 


AN EXAMPLE OF NUMBERS CLASSIFIED BY. COLUMNS 
Columns 


Sums 35 40 60 
Means 7 8 12 
Sums of squares 259 330 738 
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Computation of the total sum of squares requires the sum of the N values 
and the sum of their squares. We find that 
T, — 135 
and 
$2EY,-137 
kd 
Having found T,, we can compute C,, the second term in formula (10.1), 


as follows: 
I? 135: 
GS N B” 1,215. 
The total sum of squares of deviations is found by the subtraction, 


1,327 — 1,215 — 112. 


The total sum of squares can be subdivided into two additive components. 
The first of these components we shall call the combined within-column sum 
of squares. We proceed below with the definition and computation of this 
component. 

Consider the kt" column. The sum of squares of deviations from the mean 
of the kth column is 


2 (Ya — Ya}, 
i 


a measure of the variability within that column. Formula (4.2), which was 
derived earlier for computing the sum of squares of deviations from the 
mean for a set of numbers, can be rewritten for a column sum of squares, 
with some slight modifications in the symbolism, as follows: 


(2 Ya)? 


Sy oi . 
"ulia m 


Example: 
The sum of squares of deviations for the first column of Table 10.2 is 


computed, as follows: 


35* 
239 a = 14. 
The sum of squares for the second column is 
40° 
330 — = = 10. 


For the third column, it is 


60? 
738 ae i 18. 
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Each of the three quantities computed above is a measure of the variability 
within a particular column. The three values can be added, yielding the com- 


bined within-column sum of squares, an overall measure of the within-column 
variability, as indicated by the expression, 


& Pur, = vj 
k 


~ bys 


and by the computation, 
14+ 10 + 18 = 42, 

Observe that the combined within- 
less than the total sum of squares, w 


lity within columns has not taken ac 
set. We proceed now with the 


column sum of squares, which is 42, is 
hich is 112. Measurement of the variabi- 
count of all of the variability in the entire 
definition and computation of another 


n be eliminated by substituting the 
alues within that column. It will be 


Table 10.3 


VARIABILITY REMA 


INING AFTER ELIMINATION OF 
WITHIN-COLUM 


N VARIABILITY IN TABLE 10.2 
Columns 


column variability can be 
he values in Table 10.3. The 
= 9, the mean of the original values 
à value in the first column of Table 


quares computed from t 
mean of the 15 values in the table is M, 


in Table 10.2. A deviation from M, for 
10.3 is 


(7 — 9). 
The square of this deviation is 


Q = 9 
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Observe that all five squared deviations for values in the first column are the 
same. Therefore, for the first column, the sum of squares of deviations from 
M , can be written as the product, 


s(7 — 9). 

For the second column, the sum of squares of deviations from M, is 
5(8 — 9)*. 

For the third column, it is 
5(12 — 9}. 


Combining these three quantities as a measure of the between-column 
variability yields 


5(7 — 9)? + 5(8 — 9)? + 5(12 — 9)? = 70. 
These numerical expressions, which combine to yield the between-column 
Sum of squares, can be written symbolically, as follows: 


n(Y., — Mj? + n(Ys — Mj? + n(Y5 — Mj 
or 


XY, — Mj? 
R 
or 


n XQ. — Myr. 
k 


If this expression for the between-column sum of squares is expanded and 
simplified, with appropriate substitutions, the result is a convenient computing 
formula: 


OE (10.2) 
n 

Formula (10.2), representing the between-column sum of squares, can be 
translated into verbal instructions, as follows: Find the sum of each column. 
Square each column sum. Sum the squared column sums and divide the result 
by n, the number of values per column. Compute C, = T7/N, where T, is 
the sum of all N values. Subtract C, from the quotient obtained just 
previously. 

The derivation of formula (10.2) is based on the assumption that n, the 
number of values per column, is a constant. When 7 varies, the instructions 
are modified in the following respects: Each squared column sum is divided 
by the number of values in that column. The c quotients which result are 
then summed. Finally, the correction term, C, is subtracted. 

The between-column sum of squares for the numbers in Table 10.2 was 
Computed from the numbers in Table 10.3 and found to be equal to 70. We 
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shall demonstrate the use of formula (10.2) in computing the same quantity. 
The column sums from Table 10.2 are 35, 40, and 60; n is 5; C, is 1,215. 
Substituting these values in formula (10.2), we obtain 
2 + 40? + jo 
EI a = 1.315 270. 


Summary 


Having been given a set of numbers classified by columns, we set out to 
analyze the total variability into two components. We found the total sum 
of squares of deviations from M, to be 112. For each column, we computed 
the sum of squares of deviations from its mean. We then combined the three 
values, 14, 10, and 18, to obtain a measure of the combined within-column 
variability, a sum of squares equal to 42. Finally, we eliminated the within- 
column variability by substituting the mean of a column for each value in that 
column. The remaining variability was called between-column variability. For 
these substituted values, the sum of squares of deviations from M , was found 
to be 70. 

Notice the relation between the total sum of Squares and the two com- 
ponents: the between-column sum of squares and the within-column sum of 
Squares. In the example, 

112 = 70 + 42. 


That is to say, the total variability is equal to the between-column variability 
plus the within-column variability. This relation holds true in general. The 


example given above does not establish the generality of the relation, but it 
can be established. The proof will not be given here. 


On Classifications 


Let C, be the kth category in a set of c categories into which a set of N 
numbers can be classified with m, numbers per category. In the classification 
Scheme of the preceding section, the categories corresponded to columns in the 
rectangular tabulation. A classification involving one set of categories is 
sometimes called a single classification. There is nothing to prevent our 
classifying the same set of N numbers into another set of categories. 

Let R; be the jth category in the second set of r categories. Classifying the 


N numbers simultaneously into both sets of categories will be defined here 
as distributing them into ne 


w categories specified by the symbol combination, 
RCs 
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That is, a number is classified into a new category because it belongs to the 
jt category in the R set and the kth category in the C set. The number of new 
categories is the number of possible combinations of R; and C;, which is 
(r x c). The simultaneous classification can be represented by a rectangle of 
r rows, c columns, and (r x c) cells. The category A;C; is the cell defined by 
the intersection of the jt® row and the K'^ column. It is sometimes convenient 
to refer to such a classification as a double classification or as an (r X c) 
classification. 

Table 10.4 illustrates the principle of doubly classifying numbers. Columns 


Table 10.4 
SIMULTANEOUS CLASSIFICATION ON TWO SETS OF CATEGORIES 


First Set 


Ci C; C; s C; E C, 


Ri 
R, 
Rs 
Second 
Set 


and rows of the table are appropriately labeled. Each cell contains a com- 
bination of symbols indicating the combination of categories represented by 
that cell. Symbols in the cells of Table 10.4 are not to be interpreted as 
numbers. The symbols identify the new categories. The table only serves the 
Purpose of making explicit the way in which numbers can be classified 
simultaneously with respect to two sets of categories. 


Variability in a Double Classification 


When numbers in a set are classified in two different ways simultaneously, 
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as shown in Table 10.5, the variability of the entire set can be analyzed into 
three components. 


Table 10.5 


SCHEME FOR DOUBLY CLASSIFYING NUMBERS 
WITH ONE NUMBER PER CELL 


Columns 


Rows 


way of classifying; r = number 


= à gories in the second way of classifying. 
en (r x c) = number of cells — number of combinations in the simul- 
taneous classification, 


Let n = number of 


The mean of the r Values in the kth 


T, the sum of all N values, can be written as the sum of the c column sums. 
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That is, 


The mean of all N values is 
My, = 
The sum of the c values in the jt” row can be written 
c 
2 Ya. 
k 
The mean of the c values in the jt" row is 


Ifa 
k . 


BUT. 
The sum of all N values was written above as the sum of column sums. It 
can be written as the sum of row sums, as follows: 
roc 
Ty = 22 Vix 
jk 
The total variability exhibited by the V numbers is defined by either of two 
equivalent expressions: 


e r 
È S (Yn — M) 
kJ 

or 
Ff c * 
EEQy.-MS 
7 E 


Thus in defining the total sum of squares it makes no difference whether one 
Sums the r squared deviations in each column and combines these c quantities, 
Or sums the c squared deviations in each row and combines the r resulting 
quantities. One obtains the same result for the total sum of squares by both 
methods. 

Either one of these expressions for the total sum of squares can be 
expanded and simplified. If appropriate substitutions are then made, a 
convenient computing formula results: 


BI Y—, (10.3) 
kj 


or 


where C, = T/N: 
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Example: . . — 
Table 10.6 contains a 3 x 4 classification of numbers with one nu 


Table 10.6 
EXAMPLE OF A DOUBLE CLASSIFICATION OF NUMBERS 


Columns 
1 2 3 4 Row Row 
Sums Means 
1 28 7 
Rows 2 32 8 
3 12 3 


Column Sums 
Column Means 
Sums of Squares 
of Original Values 134 200 101 


83 


cell. Thus r = 3; c 4; and N =r x c = 12. The sums and means of rows 
and columns are arranged on the margins of the table. The sum of squares 
of the values in each column is also given below the column. 

Computation of the total sum of squares of deviations by means of 
formula (10.3) requires the sum of squares of all N original values and the 
quantity, C,. The sum of Squares of the original values is 


134 + 200 + 101 + 83 = 518. 


The other required value, C, — TN, is computed, as follows: 


722 
Cy = 75 = 432, 


Substitution of these values in formula (10.3) yields 


518 — 432 = 86, 


-column variability has 
Shall refer to what remains 


as between-column variab column sum of squares of 


ility. The between- 
deviations is defined by the 


expression, 


$ «Y, — My, 
k 
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Table 10.7 
ELIMINATION OF WITHIN-COLUMN AND 
WITHIN-ROW VARIABILITY IN TABLE 10.6 


A. Results of substituting column means: 


6 8 5 5 
6 8 5 5 
6 8 5 5 


7 7 yd 7 
8 8 8 8 
3 3 3 3 


from which can be derived a convenient formula, 


E Ya 
Rd—— —C (10.4) 


r i 


Formula (10.4) should be compared with formula (10.2). The former is 
Simply an adaptation of the latter. Formula (10.4) can be translated into 
verbal instructions, as follows: Find the sum of each column; square each 
column sum; combine the squared sums and divide by r, the number of rows; 
subtract C, = T2/N from the resulting quotient. 

Example: 

The computation of the between-column sum of squares for the numbers 
in Table 10.6 (or for those in arrangement A of Table 10.7) requires the 
substitution of column sums, a value for C,, and a value for r in formula 
(10.4), as follows: 


182 + 242 + 15? + 15° 
3 


The variability remaining in Table 10.6 after the within-row variability has 
been eliminated can be observed in arrangement B of Table 10.7. We shall 
refer to this remaining variability as between-row variability. The between- 
row sum of squares of deviations is defined by the expression, 


432 = 18. 


EqY,.— My, 
j 
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from which can be derived a convenient computing formula, 


A n 
j a = & (10.5) 

Formula (10.5) should be compared with formulas (10.4) and (10.2). The 
common features of the three formulas should be apparent. Formula (10.5) 
can be interpreted as giving the following instructions: Find each row sum; 
square each row sum; combine these squared sums and divide by c, the 
number of columns; subtract C, = T?/N from the quotient. 

Example: 

To compute the between-row sum of squares for the numbers in Table 10.6 
(or for those in arrangement B of Table 10.7), one substitutes values for the 
row sums, C,, and c in formula (10.5), as follows: 


2 2 2 
Ee S ag = 56, 


With respect to the numbers in Table 10.6, we have established that the 
total sum of squares is 86, the between-column sum of squares is 18, and the 
between-row sum of squares is 56. Observe that the sum of the two com- 
ponents is less than the total. There is some remaining variability, for which 
we have not yet accounted. What remains is called the residual variability. 

The residual sum of squares is defined, as follows: Let rj, be a residual 


deviation for the cell in the Jj row and the kt column. Then the sum of 
squares of the N residuals is 


A residual deviation, r;,, is defined as 


(Yi. — My) — (Y. — M,) — (Y. — M,). 
That is, rj, is what remains of the deviation, (Y; — M ,), after subtraction 
of the deviation, (Y. — M,), and the deviation, ( Y., — M,). 
When the terms in the defining expression are combined, one obtains 


tin = Ya — Y. — Vig + M, (10.6) 
Example: 


The residual deviations for the numbers in Table 10.6 are displayed in 
Table 10.8. The computation of four of the twelve residual deviations is given 


Table 10.8 


RESIDUAL DEVIATIONS FOR 
NUMBERS IN TABLE 10.6 


0 +1 0 —1 
Tl =2 +1 0 
—1 +1 —1 +1 


Summary 197 


below. In the computation, values from Table 10.6 were substituted in 
formula (10.6). The student should verify the computation of the other eight 
residuals. 


moe TH7-6Le= 0 
ry = 9-8 —64+6=+41 
rg = 2-3—-64+6=-1 
rye = 0-7-8 +6=+41 


The residual sum of squares is the sum of squares of the twelve values in 
Table 10.8: 


cr. 
S E r3 = 12. 
kj 


The total variability of the numbers in Table 10.6 has now been analyzed 
into three components. The total sum of squares is 86; the between-column 
sum of squares is 18; the between-row sum of squares is 56; the residual sum 
of squares is 12. Notice that the total is equal to the sum of the three com- 
ponents. That is, 


86 = 18 + 56 + 12. 


The additivity of these three components is an example of a relation whose 
generality will not be proved here. It can be shown that the total sum of 
Squares is equal to the sum of the three components: the between-column 
sum of squares, the between-row sum of squares, and the residual sum of 
squares. The example itself does not, of course, constitute a proof of the 


generality of the relation. 


Summary 


Two ways of analyzing the variability in a rectangular table of numbers 
have been described in the present chapter. The first method of analysis was 
applied to a single classification of numbers. The total sum of squares was 
subdivided into two components: the between-column sum of squares and 
the within-column sum of squares. It was observed that the two components 
were additive, that is, they combined to yield the total. The second method 
of analysis was applied to a double classification of numbers. The total sum 
of squares was subdivided into three components: the between-column sum 
of squares, the between-row sum of squares, and the residual sum of squares. 
The three components combined to give the total. 

Two principles of analysis were stated. The first was that the total variabi- 
lity in a single classification of numbers can be analyzed into two additive 
components: the between-column sum of squares and the within-column sum 
of squares. The second was that the total variability in a double classification 


198 The Analysis of Variability 


of numbers can be analyzed into three additive components: the between- 


column sum of squares, the between-row sum of squares, and the residual 
sum of squares. 


Double Classification and Correlation 


Product-moment correlation was discussed in Chaptei 7 with reference to 
a double classification of numbers, although the tabulation was not so desig- 
nated. Table 7.1 consisted of N rows and two columns with one entry per 
cell. The columns corresponded to two sets of numbers, X and Y. The rows 
corresponded to pairs of numbers, a pair consisting of one number from each 
set. In a table of r rows and two columns, the correlation between the columns 
can be expressed in terms of the between-row variability, the residual 
variability, and the variability within the columns. A connection is thereby 
established between correlation and the analysis of variability for classifica- 
tions of numbers, 

To emphasize the connection between correlation as a measure of con- 
comitance for paired values of Y and Y, and the analysis of variability for 


anr x 2 classification of numbers, we shall employ two different systems of 
symbols for the same quantities, 


Let r,, = the correlation between the columns of an r x 2 classification 


of numbers — zy = the correlation between two sets of paired numbers, 
X and Y. 

Let B — the between-row sum of Squares for an r x 2 classification — the 
between-pair sum of squares for variables Y and Y. 

Let R — 


- the residual sum of squares for an r X 2 classification = the 
residual sum of squares for the N x 2table of X and Y. 
Let W, — the sum of squares for the variability within the first column of 


ther x 2 classification = W, = the sum of squares of deviations for X. 
Let W, — the sum of squares for the variability within the second column 
of the r x 2 classification — W, = the sum of squares of deviations for Y. 
It can be shown that the sum of products of deviations for the arrays of 
either arrangement is equal to 


(B — R)/2, from which it follows that 
B—R 
nh. = — (10.7) 
24 W. 
and that AW, 


oR 10.8) 
24 WW, 2 


Fay, = 


Formula (10.7) states that t 


J la (IC he correlation between the columns of an r x 2 
classification is a function o 


f the between-row sum of squares, the residual 
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sum of squares, and the variability within each column. Formula (10.8) states 
that the correlation between X and Y is a function of the between-pair sum 
of squares, the residual sum of squares for the N x 2 table, and the sums of 
squares for X and Y. Imagine that the W^s are fixed values in both formulas. 
It follows that rs and r,, will increase in algebraic value as B increases and 
R decreases. When 


B>R, 
the correlation is positive; when 
B=R, 
the correlation is zero; when 
B<R, 


the correlation is negative. 


Pragmatics and the Analysis of Variability 


Although methods of analyzing variability into component sums of squares 
can be applied to any table of numbers having the specifications set forth 
early in the present chapter, these methods of analysis are seldom used for 
purely descriptive purposes. They are, however, widely used in statistical 
inference, especially in the analysis of experimental data. Since the descriptive 
uses of these methods are of little importance, no discussion of them will be 
undertaken in this text. The very important role of these methods of analysis 
in statistical inference, in connection with the evaluation of research data, 
will be treated in Chapter 15. At that time it will be necessary to take up the 
computation of variances from the component sums of squares. 

There is one topic in descriptive statistics which merits comment at this 
point. In Chapter 9, the use of product-moment correlation as a measure of 
reliability was presented. In that connection, an allusion was made to an 
interpretation of the coefficient in terms of interindividual variability and 
intraindividual variability. It is this interpretation which deserves additional 
comment. 

In the earlier discussion of Chapter 9, it was stated that the product- 
moment correlation coefficient compared interindividual variability, the 
differences among the average standings of the subjects tested on two occa- 
sions on the same test, with intraindividual variability, the difference in 
standings for each subject on the two occasions.” This comparison is made 
explicit by formula (10.8). B stands for the between-pair sum of squares, 


?An individual's “ standing " was defined as the deviation of his score from the mean 
of the group on a given occasion. 
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which is a measure of differences among average standings. R stands for the 
residual sum of squares, which combines measures of the variability in the 
two standings for each one of the N persons tested. Because the correlation 
can be expressed in terms of the comparison of interindividual variability and 
intraindividual variability, psychologists often refer to a reliable test as one 
which reveals large individual differences, or as a very discriminating test, 
that is, one which discriminates among respondents. 


SAMPLING THEORY 


Syntactics 11 


The most important applications of statistics involve what is known as 
statistical inference. In evaluating research data by the methods of statistical 
inference, one attempts to go beyond the mere description of a particular set 
of data with its unique temporal and spatial context, to take account of error 
in the data and in values computed from the data, and to arrive at proposi- 
tions of some more general and transcendental meaning. These statistical 
inferences from the particular to the general are not always correct, but, under 
certain conditions, they endow the evaluation with a rigor which has not yet 
been, and perhaps never will be, equaled by any other means. 

Underlying the applications of statistical inference is a body of theory 
concerning error. The syntactical features of this body of theory should be 
made explicit before any attempt is made to use the theory. In the present 
Chapter, we shall try to sketch the principal features of a basic part of this 
theory, a part which might be called sampling theory. In doing so, many 
detailed developments which would ordinarily be covered in a course in 
mathematical statistics will be omitted." It is our intention here to attempt to 
convey to the student some understanding and appreciation of the nature of 
the theory of random errors, by means of a very general and quite elementary 
formulation. 

In addition to the underlying theory of random errors, there are certain 
considerations which are important in supporting and guaranteeing the rigor 
of statistical inferences. This additional set of considerations is semantical in 
nature and has to do with (1) the adequacy of measurement, a topic which 
has already been discussed in Chapter 5, and (2) the adequacy of the opera- 
tions by means of which an investigator tries to realize in practice the 
requirements of the theory with respect to the nature of errors. 


'Students who wish to consult a standard, detailed text in mathematical statistics might 
examine R. V. Hogg and A. T. Craig, Introduction to Mathematical Statistics (New York: 
Macmillan, 1959). 
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Chapter 12 will present in summary form some of the major Sent 
statistics with definitions and brief descriptions of their logical properties wi 
respect to error. In Chapter 13, the semantical issues of statistical inference 

i discussed. —- 
ba begin our discussion of sampling theory with a frequency distribu- 
tion for variable X. We shall refer to this frequency distribution as a popula- 
tion or parent distribution because we plan to form samples out of values 
chosen from the distribution. Let Y; be the value of X in the jth score interval 
of the population. Let f; be the frequency of occurrence of X; in the popula- 
tion. Further, let r be the number of distinct and equal score intervals in the 


distribution. Finally, let N equal 2f;, the total number of values in the 
distribution. The distribution is shown in Table 11.1. 


Table 11.1 
A POPULATION DISTRIBUTION 
Xx J 
x fi 
x, 2 
Xs fa 
X; fi 
X, f. 


r = number of distinct score intervals. N = Zf; = total number of values. 


the application of this s 
forming combinations, 
Student, makes possibl 


restriction on the repe: 
to form several combinations of two values, any 
Table 11.2 is allowed by the System. We shall say that giving the particular 


n and the order of their selection identifies a way of 


n. Thus the pairs listed in Table 11.2 are examples of 
different ways of choosing combinations of two values. 

It is worth noting that all Possible differe 
defined by the (r x c) 


* For example, if we desire 
one of the pairs of values in 
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r — c and the matrix is square. In interpreting the matrix, we let the inter- 
section of the jt" row and the K!^ column represent a way of choosing two 


Table 11.2 


EXAMPLES OF PAIRS OF VALUES ALLOWED BY THE SYSTEM 
FOR SELECTING FROM THE POPULATION IN TABLE 11.1 


First Choice Second Choice 
Pf Xa 
X. x 
Pairs or x Xa 
Ways of Xa Xx 
Choosing Xs x; 
Xx Pf 
X; Xs 
X, Xs 


values, Y; and Y;. In the diagonal cells running from the upper left corner 
to the lower right corner are combinations for which j = k and X; = Xj. 


Table 11.3 


ALL POSSIBLE DIFFERENT WAYS OF CHOOSING TWO VALUES 
FROM THE POPULATION DISTRIBUTION IN TABLE 11.1 


Second Choices 
x, X: Xa : Xj , X, 


First 


Choices 


Note: In this instance, r = c and the matrix is actually square. 


The matrix is symmetrical around the diagonal in the sense that the com- 
bination for the second row and the third column involves the same values 
as the combination for the third row and the second column, but chosen in 
a different order. To represent all possible different ways of choosing com- 
binations of three values, a figure of three dimensions, that is, a cube, would 
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be required. For combinations of n values, an n-dimensional arrangement 
would be needed. Á 

The cells of the square in Table 11.3 define all possible different ways of 
choosing combinations of two values. A way of choosing two values is 
identified by listing the values in the order of their selection. When there are 
r distinct values in the population, the number of different ways of choosing 
combinations of two is r?, 

There is an important distinction which should now be made. The dis- 
tinction involves the number of ways of choosing and the total number of 
possible combinations. The total number of possible combinations of two 
values is N?, a conclusion reached by observing that there are N possible 
first choices and, for each of these, N possible second choices. 

To determine all possible combinations of two values, we must take account 
of the frequency in each interval of the scale of X. Consider the cell in the 
second row and third column of Table 11.3. It represents the combination, 
Xa chosen first and X, chosen second. We have said that X, occurs with a 
frequency of f, and ¥, occurs with a frequency of f, in the population. The 
number of possible combinations of Y, chosen first and X, chosen second is 
(fs X fa). The reasoning which supports this conclusion is simple. There are 


fz possibilities in choosing X, first; for each of these, there are f, possibilities 


in choosing Y, second. It follows that the total number of these combinations 
is (fa X fa). 


Table 11.4 gives the n 


) umber of possible combinations for each way of 
choosing two values. 


Table 11.4 


NUMBER OF POSSIBLE COMBINATIONS PER WAY 
OF CHOOSING TWO VALUES 


Second Choices 


f 
. fs 
First 
Choices 
"n 
f, 


Note: In this instance, r — c and the matrix is Square, 
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To summarize, the number of different ways of choosing two values is r?; 
the number of possible combinations of two for any way of choosing is 
fifi; the total number of possible combinations of two is N?. In general, the 
number of different ways of choosing combinations of n values is r”; the 
number of possible combinations for any way of choosing is the product of 
n marginal frequencies; and the total number of possible combinations is N”. 

The frequencies in the cells and on the margins of Table 11.4 are often 
expressed as relative frequencies. In the population and in each of the 
corresponding marginal distributions, p;, the relative frequency for the jt 
interval, is defined as 

á 


Pi= N 
and p,, the relative frequency for the kt” interval, is defined as 


=h. 
P= 


In the square matrix of Table 11.4, the relative frequency for the cell in the 
J™ row and the kt® column is 
fe fife 
Pik = A = N - = pipi. 
Observe that the relative frequency for a cell is equal to the product of the 
two relative frequencies on the margins. Observe also that 


2 pi =1, 
j 
Pip =1, 
k 


and 
222 jn. =1. 
ki 


In further discussions of the population distribution, the corresponding 
marginal distributions, and the distribution of combinations, we shall employ 


relative frequencies. 


Distribution of the Sample Mean 


Any combination of n values chosen from the population distribution in 
Table 11.1 according to the strictly logical system outlined above is, by 
definition, a random sample. Any value computed from the values in a 
random sample is an /-statistic. It is of very great importance that a frequency 
distribution can be deduced which will specify the values taken by a given 
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i-statistic for all possible random samples of a particular size. The frequency 
distribution of values of a given i-statistic is called the sampling distribution 
of that i-statistic. Of course, many different i-statistics can be computed from 
the values in a sample. For demonstration purposes, the sample mean will do 
quite well and we shall use it to show the deductive process by means of 
which a sampling distribution can be established. 

Let us again work with samples of two values. For any sample, the sum of 
the two values in that sample will be 

X; +X, 

and the sample mean will be 


M, En s 
2 
Since M, is computed from the v; 


alues in a random sample, it is an inferential 
statistic or an i-statistic, 


: d in a new distribution showing the 
relation between relative frequency and values of M,. 


An example may help in clarifying 


„exam the procedure of establishing the 
sampling distribution of M,. Table 11 


-5 contains a simple, three-interval 


Table 11.5 
AN EXAMPLE OF A POPULATION 
x p 
0 0.50 
1 0.20 
2 0.30 


In the example, r = 3 and r? — 9. The nine different ways of choosing 


fied by the entries in the nine cells 
On the left in each cell: the second 
frequency for each cell is the pro- 
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Table 11.6 


WAYS OF CHOOSING TWO VALUES FROM 
THE POPULATION OF TABLE 11.5 


In each cell, the first choice is on the left, the second is on the right. 


duct of the marginal relative frequencies. Relative frequencies for the margins 
and the cells are displayed in Table 11.7. The value of M, for the combination 


Table 11.7 
RELATIVE FREQUENCIES FOR WAYS 
OF CHOOSING TWO VALUES 


0 1 2 xX 
50 20 30 Pr 


X; Pi 


in each cell has been computed and is shown in arrangement A of Table 11.8. 
In arrangement B of the same table, cells containing the same value of M, 


Table 11.8 
MEANS OF SAMPLES OF TWO VALUES 
A 
Xy 
0 1 2 
0 
Wd 
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have been assigned a common label from the letters: A, B, C, D, and E. The 
combined relative frequency for each different value of M, is the sum of the 
relative frequencies in cells having a common label. If we let the labeling 
letters be variables representing relative frequencies, then for M, = 0, the 
relative frequency is 4 = .25; for M, = .5, the combined relative frequency 
isZB = .10 + .10 = .20; for M, = 1, the combined relative frequency is 
XC = .15 + .04 + .15 = .34; for M, = 1.5, the combined relative frequency 


is ZD = .06 + .06 = .12; and finally, for M, = 2, the relative frequency is 
E — .09. 


If we let P be the relative frequency of any value of Ma, then the sampling 
distribution of M, for samples of two values is the relative frequency dis- 
tribution of Table 11.9. The sampling distribution of M, gives the distribution 
of all possible values of M, for samples of a specified size. 


Table 11.9 
AN EXAMPLE OF A SAMPLING DISTRIBUTION 
M, P 
0 25 
5 .20 
1. 34 
1.5 2: 
2. .09 


M, computed from samples of two values from the population in Table 11.5. 


The reasoning which entered into the determination of the distribution of 
w given in Table 11.9, isa very simple example of deduction in sampling 
theory. Although the extensions and elaborations of deduction in sampling 
theory go far beyond this simple example and sometimes involve advanced 
and difficult mathematics, all theoretically deduced sampling distributions 


serve basically th pling distribution of any i-statistic 


€ same purpose. The sam 
is simply a specification of the relative frequencies of the totality of possible 


values for the particular i-statistic. 


The i-statistic in the example given above was a mean computed from a 
single sample. There are, of course, many other i-statistics. Sampling dis- 
tributions have been established for values, other than the mean, computed 
from a single sample, for values computed from a single sample from a bi- 


Distribution of the Sample Mean 209 


variate or multivariate distribution, for values computed from two or more 
samples, and for values computed from samples classified in a great variety 
of ways. 

The deductive procedures employed in obtaining the sampling distribution 
in Table 11.9 are applicable to discrete population distributions. Discrete 
distributions are those having a finite number of score intervals. The deductive 
procedures of advanced work in sampling theory differ from those of the 
example in three respects. First, the population distribution is continuous; 
that is, r, the number of score intervals, is indefinitely large. Second, the 
relative frequencies of the population distribution are specified by an equation 
for a continuous function. Third, the relative frequencies of the sampling 
distribution are specified by an equation for a continuous function. 

The general principle we have been trying to communicate is summarized 
in the following conceptual formula: A specified population distribution plus 
a standard deductive procedure yields the sampling distribution of an 
i-statistic. 

None of the deductive procedures of advanced work in sampling theory 
will be given here because these topics are appropriate to courses in mathe- 
matical statistics. What are important to psychological statistics and to the 
use of statistics in psychological research are the results of the deductions, the 
sampling distributions. In Chapter 12, the sampling distributions of a number 
of the more important i-statistics will be identified and the general nature of 
the information we need about these sampling distributions will be described. 


Exercise 11.1 
Given in Table 11.10 is a discrete population distribution. Obtain the sampling 


Table 11.10 
A POPULATION DISTRIBUTION 
X p 
0 20 
1 50 
2 .20 
3 10 


distribution for the mean of random samples of two values. What steps would be 
necessary to extend the procedure to samples of three values? 


SAMPLING 
DISTRIBUTIONS 
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In deducing the form of a sampling distribution, one must begin with a 
specification of the population distribution. A vast part of sampling theory 
is concerned with the drawing of random samples from a population dis- 
tribution in which the relative frequency of X is specified by a normal 
probability function or normal curve. Although there are many other 
functions or curves which could serve as population distributions, the family 
of normal curves has occupied a central position in statistical theory. Because 


of its central position, a number of i-statistics deriving from normal curve 
theory will be discussed in this chapter. 


The equation for a normal curve is 


1 z fer) 
= — 20 
= 
P SN 


x 


Fig. 12.1 A normal probabilit 


r y function or normal 
represent a population distributio 


curve. It is frequently employed to 
n in statistical inferen 


ce. (The variance is o2) 
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where p is the relative frequency expressed as a function of X; p is the mean 
of the population; c? is the variance of the population; e is a constant value 
of approximately 2.71828; and ~ is a constant value of approximately 3.14159. 

A normal curve is symmetrical and unimodal. Its branches approach, but 
do not reach, the horizontal axis no matter how far they are extended. S, the 
index of skewness, is 0 and K, the index of kurtosis, is 3. An example of a 
normal curve is given in Figure 12.1. 

Imagine now that the function in Figure 12.1 is a population frequency 
distribution. Of interest to us here is the problem of forming all possible 
random samples, of some specified size, from this normally distributed popu- 
lation and determining the distribution of all possible values of some specified 
i-statistic computed from each random sample. The theoretical process of 
determining the sampling distribution of a specified i-statistic is one of 
mathematical deduction beginning with the equation for a normal curve and 
ending with another equation for the sampling distribution. Some of these 
sampling distributions are primarily of theoretical interest and importance; 
Others are of great practical importance in the evaluation of research data. 


Sampling Distributions of Theoretical Interest 


THE MEAN OF A SAMPLE 


M,, the mean of a random sample of values, is itself normally distributed 
with mean equal to p, the population mean, and variance equal to o?/n, where 
o? is the population variance. A normal curve representing a sampling dis- 
tribution for M, is shown in Figure 12.2. That is to say, when the population 


A 
E 


Mx 


Fig. 12.2 A normal curve representing the sampling distribution of values of M; 
computed from all possible random samples from a normally distributed population. 
(The variance is o2/n.) 
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is normally distributed, the totality of values for means of random samples of 
some fixed size is also normally distributed. 


THE VARIANCE OF A SAMPLE 


V, the variance of a random sample of n values, is seldom used as an 
i-statistic and its sampling distribution receives little attention. In place of 
Va the i-statistic, V4, is employed. V is defined as 


UE e: à 
= a=] 


Thus V; is the sample variance computed by dividing the sum of squares of 


deviations by (n — 1) instead of n. The divisor, (n — 1), is usually called the 
number of degrees of freedom. V. is sometimes referred to as the corrected 
value of the sample variance. The reason is that 


n 
Ld 
K = 


= V 
Fo" 


z. 


Some additional implications of this correction will be discussed in connection 
with the estimation of population values. 

Vj, the sample variance computed with degrees of freedom, has a known 
sampling distribution whose equation will not be given here. That is to say, 
when the population distribution is normal, all possible values of V/ for 


2 
0 o v, 
a E 12.3 The sampling distribution of Vz when n — 2. (The mean of the distribution 


= 
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samples of some specified size from that population form a distribution exactly 
fitted by a known mathematical function. The mean of the sampling distribu- 
tion of V. is o?, the population variance. Figure 12.3 shows a distribution of 
V! whenn = 2. It is worth noting that a closely related variable, (n — DV ie?, 
is distributed as chi square. That is, it has a sampling distribution specified 
by one of a family of functions called chi square functions. A particular chi 
Square distribution is identified by its mean. The mean of a chi square distri- 
bution is equal to (n — 1) the number of degrees of freedom employed in 


computing V;. 


THE DIFFERENCE BETWEEN TWO SAMPLE MEANS 


Given all possible combinations of two random samples, each sample 
consisting of n values from a normally distributed population, D,, the 
difference between the means of two samples, is normally distributed. The 
mean of the sampling distribution is zero and the variance is equal to 26?/n. 


A sampling distribution of D, is displayed in Figure 12.4. 


0 
Dx 


Fig. 12.4 A normal curve representing the sampling distribution of D,, the difference 


between means of two random samples of 7 values. The mean of the sampling distribution 
is zero. (The variance is 207/n.) 


BETWEEN-SAMPLE AND WITHIN-SAMPLE VARIANCES 


In Chapter 10, it was shown that the total variability of numbers classified 
into subsets could be analyzed into additive components. Sampling provides 
One basis for classification. Numbers selected from a population distribution 
can be classified according to the sample in which they occur. For example, 
if we were given c samples of n values from a normally distributed population, 
We could classify the (cn) values into c subsets or columns corresponding to 
thec samples. The total sum of squares for the N — cn values could then be 
analyzed into two components: a between-column or between-sample sum of 
Squares and a within-column or within-sample sum of squares. Variances can 
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be computed from these two components. V ;, the between-sample variance, 
is computed by dividing the between-sample sum of squares by a number of 
degrees of freedom equal to (c — 1). In the case of two samples, of course, 


3 df 
2 df 
P 
P 
| c! 
" 
0 ci 
v 
20 df 
5 df P 
P 
Se 
v 


v 
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distribution has a mean equal to RU dom used in computing the variance. Each 
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(c — 1) = 1. Vi, the combined within-sample variance, is computed by 
dividing the combined within-sample sum of squares by a number of degrees 
of freedom equal to (N — c). In the case of two samples, (N — c) — (N — 2). 

Vi, the between-sample variance, has a known sampling distribution, 
whose equation will not be given here. The mean of the sampling distribution 
is o2, V}, the within-sample variance, also has a known sampling distribution. 
Again the mean of the sampling distribution is o. Figure 12.5 shows functions 
which can be viewed as representing distributions of either V ; or V, depend- 
ing on the number of degrees of freedom employed in computing the 
i-statistic. For example, the function labeled with two degrees of freedom 
could be the sampling distribution of V, for three samples or of V „ for two 
samples of two values each. Notice that the number of degrees of freedom 
employed in computing V; for three samples is two and that the number of 
degrees of freedom employed in computing V for two samples of two values 
each is two. The function labeled with three degrees of freedom could be the 
sampling distribution of V ; for four samples or for V ;, for two samples, one 
of three values, the other of two. The function labeled with five degrees of 
freedom could be the sampling distribution of V for six samples or of Vy, 
for samples of three and four values. The function labeled with twenty degrees 
of freedom could be the sampling distribution of Vj, for four samples of six 
values each or for five samples of five values each. The student should figure 
out other combinations of c and 7 for which the sampling distributions of 


V or V/, would be represented by these functions. 
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THE DISTRIBUTION OF t 


There is an important i-statistic which is designated f. A function which 
Specifies the sampling distribution of t is called a ¢ function. There is a family 
of t functions, one for each particular number of degrees of freedom involved 
in the computation of t. Figure 12.6 shows ¢ functions for 1, 2, 5, and 30 
degrees of freedom. A t function is symmetrical and unimodal. Its branches 
approach but never reach the horizontal axis no matter how far they are 
extended. Its mean is zero. Its variance is f/(f — 2) if f > 2, where f is the 
number of degrees of freedom identifying the function. Its index of skewness 
is zero, Its index of kurtosis is greater than three. As the number of degrees 
of freedom increases, the distribution of t approaches the normal form. 

Values of ¢ can be computed from a variety of other i-statistics. Three cases 
are given below. 
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1. For a single random sample of n values from a normally distributed 

population, 
M, p. 


E (12.1) 
A Vin 


Formula (12.1) can be interpreted as saying that the ratio on the right is 
distributed as t. The ratio compares the difference between M,, the sample 
mean, and p, the population mean, with a value computed from V}, the 
sample variance computed with (n — 1) degrees of freedom. The sampling 
distribution of t computed by formula (12.1) is given, in Figure 12.6, by the 
t function for one degree of freedom when n — 2, by the t function for two 
degrees of freedom when n = 3, by the ż function for five degrees of freedom 
when n = 6, and by the ż function for thirty degrees of freedom when 1 —31. 
2. For two random samples from a normally distributed population 
distribution of X, 
D. 
B A2 Vin 


t (12.2) 


where D, is.the difference between two sample means, V, is the combined 
within-sample variance computed with 2(n — 1) degrees of freedom, and n is 
the number of values per sample. Formula (12.2) can be interpreted as saying 
that the ratio on the right is distributed as t. The ratio compares the difference 
between two sample means with a value computed from the combined 
within-sample variance. The number of degrees of freedom is 2(n — 1). The 
sampling distribution of computed by formula (12.2) is given in. Figure 12.6 
by the z function for two degrees of freedom when n = 2, and by the t 
function for thirty degrees of freedom when n — 16. 

3. For a single sample of paired values from a bivariate population 
distribution of X and Y, 


mE (12.3) 
In formula (12.3), rey is the product-moment correlation coefficient computed 
from a sample of N pairs of values. In this case, the sampling distribution of 
t contains values which could be computed from the totality of sample correla- 
tion coefficients based on all possible pairings of values of Y with a given set 
of N values of Y. The deductions which establish that the ratio in formula 
(12.3) is distributed as ¢ require that the Y's for any given X are normally 
distributed and that p, the population correlation, is zero. The number of 
degrees of freedom is (N — 2). The sampling distribution of ¢ computed by 
formula (12.3) is given in Fig. 12.6 by the ! function for two degrees of 
freedom when N = 4, and by the : function for thirty degrees of freedom 
when N = 32. 
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Another important i-statistic, designated F, is the variance ratio. A 
function which describes the sampling distribution of F is called an F function. 
There is a family of F functions. A particular curve from the family is 
identifiedby a combination of degrees of freedom. The combination consists 
ofthe two numbers of degrees of freedom which enter into the computation of 
the two variances in the ratio. Figure 12.7 contains six F functions. Each of 


the six functions is labeled with two numbers giving the combination of 
degrees of freedom for that function. 


1 and 2 dí 1 and 30 df 
P 
P 
ò i E 
1 F 


ie) 


2 and 4 df 
2 and 20 df 
j P 
Q0 i " i 
F 
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5 and 5 df 20 and 20 df 


o i F Ó 1 F 


Fig. 12.7 Functions for the sampling distribution of F for six combinations of degrees 
of freedom. 


F is a ratio involving two variances. The ratio can be computed in a variety 
of ways, three of which are listed below. 

1. For c = 2 random samples from a normal population distribution, 

YS 12.4 
r= (12.4) 

Let n = the number of values per sample; let N = cn. Vj, is the between- 
sample variance computed by dividing the between-sample sum of squares 
by (c — 1) degrees of freedom. V is the combined within-sample variance 
computed by dividing the combined within-sample sum of squares by (N — c) 
degrees of freedom. F is the ratio of the between-sample variance to the 
Within-sample variance. The sampling distribution of F computed by formula 
(12.4) is given in Figure 12.7 by the F function for one and two degrees of 
freedom when c — 2 and n — 2, and by the F function for one and thirty 
degrees of freedom when c — 2and n — 16. It is interesting to note that, when 
€ — 2, F computed by formula (12.4) is equal to t? when t is computed by 
formula (12.2). 

2. For two random samples from a normally distributed population, 


five Ets (12.5) 


Let n, = the number of values in a first sample; let ną = the number of values 
in a second sample. Again F is the ratio of two variances. In the numerator 
is the variance for a first sample computed by dividing the sample sum of 
Squares by (n, — 1) degrees of freedom. In the denominator is the variance 
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for a second sample computed by dividing the sample sum of squares by 
(na — 1) degrees of freedom. The sampling distribution of F computed by 
formula (12.5) is given in Figure 12.7 by the F function for two and four 
degrees of freedom when n, = 3 and n, = 5, by the F function for five and 
five degrees of freedom when n, = n, = 6, and by the F function for twenty 
and twenty degrees of freedom when n, = n, = 21. 


3. For a single sample of paired values from a bivariate population 
distribution of X and Y, 


p LUN -2, (12.6) 


1 — ri, 


which is equivalent to a ratio involving two variances, 


where V, is the variance of predicted values and Vi.y is the variance of 
errors of prediction. Vj, is computed by dividing Zy”, the sum of squares 
of predicted values, by one degree of freedom. V/, ,, is computed by dividing 
X(y — y'Y, the sum of squares of errors of prediction, by (N — 2) degrees of 
freedom. In formula (12.6), r», is the product-moment correlation coefficient 
computed from a sample of N pairs of values. In this case, the sampling 
distribution of F contains values which could be computed from the totality 
of sample correlation coefficients resulting from all possible pairings of 
values of Y with a given set of values of X. The deductions which establish 
that the ratio in formula (12.6) is distributed as F require that the Y's for any 
given X are normally distributed and that p, the population correlation, is 
zero. The student should compare formula (12.3) and formula (12.6). F com- 
puted by formula (12.6) is equal to when t is computed by formula (12.3). 
The numbers of degrees of freedom for F are 1 and (N — 2). The sampling 
distribution of F computed by formula (12.6) is given in Figure 12.7 by the 
F function for one and two degrees of freedom when N = 4, and by the F 
function for one and thirty degrees of freedom when N — 32. 


THE DISTRIBUTION OF ô 


The standard normal deviate, an i-statistic which we shall designate as 8, 
will be discussed next. The standard normal deviate, 8, is a normally dis- 


tributed variable. Its distribution has a mean of zero, a variance equal to one, 
an index of skewness equal to zero, and an index of kurtosis equal to three. 
Only one way of computing 8 will be given here and that way is only 


approximate, not exact. For two large, random samples from a bivariate 
population distribution, the ratio 


ACA 
42KN — 3) 
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is distributed approximately as the standard normal deviate. Therefore, we 
write 

Zi-—Zi 
~ JBKN —3) 
understanding that the equality holds only approximately. Zi 18 computed 
from the product-moment correlation coefficient for the first sample, as 
follows: 


8 (12.7) 


D tre. 


eS 


i ud 
s = 5lo& 


Z, is computed in the same way from the product-moment correlation 
coefficient for the second sample. N is the number of pairs of values in each 
sample. Figure 12.8 shows a normal curve with zero mean and unit variance, 


0 
ô 


Fig. 12.8 The sampling distribution of the standard normal deviate, 6. The function is a 
normal probability function or normal curve with zero mean and unit variance. 


the function for the sampling distribution of ô. Notice that formula (12.7) 
involves the difference between Z; and Z;. Since Z| is a function of the 
magnitude of one sample correlation coefficient and Z; is a function of the 
magnitude of the second sample correlation coefficient, à indirectly compares 
the two sample correlation coefficients. In deducing the sampling distribution 
of 8, there is no requirement that p, the population correlation coefficient, be 
Zero, as there was for t by formula (12.3) and F by formula (12.6). 


The Tabled Functions 


The distribution functions of t, F, and 8 have been studied and information 
about them has been tabulated. In general terms, the tabulated information 
for a particular function consists of one or more values of the i-statistic, each 
of which is exceeded by a known proportion of all values in the sampling 
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distribution. We shall comment on the tabled functions in the order: t, F, 
and 8. i 

Table A in the Appendix presents information for 34 different t functions, 
each one identified by a number of degrees of freedom. For each of these 
t functions, a value of t is given. Let us call this particular value ż,. It has two 
interpretations: one as a positive value, the other as a negative value. Inter- 
preted as a positive value it is known to be exceeded by 0.025 of the totality 
of t values in that sampling distribution. Interpreted as a negative value, it is 
known to exceed 0.025 of the totality of ¢ values in the distribution. Thus the 
proportion of values of t exceeding the positive value of t, plus the proportion 
of values exceeded by the negative value of t, is equal to 0.05. 

Examination of Table A reveals that, for two degrees of freedom, a value 
of t, equal to +-4.303 is exceeded by 0.025 of all values of t in that distribution 
and a value of t, equal to —4,303 exceeds 0.025 of all values in the distribu- 
tion. For thirty degrees of. freedom, a value of t, equal to 4-2.042 is exceeded 


by 0.025 of all values of t and a value of t, equal to —2.042 exceeds the same 
proportion of values. 


Interpreted with respect to the sampling distribution of 1, the tabled values 
of t, provide a basis for classifying all values of t in the distribution. We shall 
return later to this notion of classifying values of an i-statistic. It has great 
importance in the pragmatics of statistical inference. 

Interpreted graphically, 
which define areas under t 
the curve and the horizon 


the tabled values locate points on the scale of ! 
he curve. Consider the total area enclosed between 
tal axis. If we erect perpendiculars to the horizontal 


2 df 
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axis at the points defined by positive and negative values of te the area to the 
right of the positive value is 0.025 of the total area under the curve and the 
area to the left of the negative value is 0.025 of the total area. Together these 
two tail areas make up 0.05 of the total area. Figure 12.9 shows the f functions 
for two and thirty degrees of freedom. Two perpendiculars have been erected 
for each curve. They were erected at --4.303 and —4.303 on the axis of the 
function for two degrees of freedom, and at +2.042 and —2.042 on the axis 
of the function for thirty degrees of freedom. The tail areas have been shaded. 

Table B, in the Appendix gives information about 168 different F functions, 
each one identified by two numbers of degrees of freedom. For each of these 
F functions, a value of F is given. Let us call this tabled value F.. It is inter- 
preted only as a positive value. It is known to be exceeded by 0.05 of the 
totality of F values in the sampling distribution. 

Examination of Table B, reveals that, for one degree of freedom in the 
numerator and four degrees of freedom in the denominator, a value of Fa, 
equal to 7.71 is exceeded by 0.05 of all values of F in that distribution. For 
five degrees of freedom in the numerator and five degrees of freedom in the 
denominator, a value of F, equal to 5.05 is exceeded by 0.05 of the values of 
F in the sampling distribution. (Table B, gives the value of F, exceeded by 
0.025 of values in the distribution.) 

_ The tabled values of F, provide a basis for classifying the totality of values 
in each sampling distribution. We shall have more to say about this classifying 
in a later discussion of pragmatics. Interpreted graphically, the tabled values 


F 


Fig. 12.10 An F function for five and five degrees of freedom. A perpendicular has been 
erected at a point on the horizontal axis such that the shaded tail area is 0.05 of the total 


area under the curve. 
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i the scale of F. At each point a perpendicular can be erected 
ree mente en the curve. Figure 12.10 shows an F function for five vn 
five degrees of freedom. A perpendicular has been erected at the ees on 
horizontal axis, 5.05. The shaded tail area is 0.05 of the total area unde 
Ni have pointed out that there is a family of t functions and a E of 
F functions. Table A provides information about 34 1 functions selected rom 
the entire family; Table B, provides information about 168 F functions 
selected from the family; Table B, gives information about 100 F ie dep 
When the number of degrees of freedom is indefinitely large, t is equal to ô, 
the standard normal deviate. Therefore, the value of 8, can be read from the 
last line of Table A. The value of 8, has two interpretations: one as a positive 
value, the other as a negative value. Interpreted positively, the value of 5, 
exceeded by 0.025 of the values in the sampling distribution is 1.96. Inter- 
preted negatively, the value of 8, which exceeds 0.025 of the values in the 
sampling distribution is —1.96. Figure 12.11 contains the normal curve in 


0 
ò 
Fig. 12.11 A normal curve with zeri 


© mean and unit variance representing the sampling 
distribution of à. Perpendiculars have 


been erected at 8 = 1.96 and 8 = — 1.96. Each 
shaded area is 0.025 of the total area under the curve. 


Standard form representing the sampling distribution of 8. Perpendiculars 
have been erected at 8, — 1.96 and 8, — —1.96. Each shaded area constitutes 
0.025 of the total area under the curve. 


Some statistical texts present very extensive tables of the standard normal 
deviate, but not for the same 


purposes served by the tables of t and F. Since 

our interest in 8 is the same a: 

information which is necessar 
Tables of t and 


and 0.025 for ext 


Teme positive values. In the distrib 
frequency 0.05 st: 


ution of F, the relative 
ands for extreme values in the positi 


ve direction only. Tables 
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of t and F are available, which are more extensive than those given in the 
Appendix. These tables can be more extensive in two ways. They may exhibit 
greater variety in the numbers and combinations of degrees of freedom 
employed. They may present relative frequencies other than and in addition 
to 0.05. Values of an i-statistic are sometimes given for relative frequencies 
such as 0.10, 0.02, and 0.01 in addition to the values for the relative frequency 
0.05. The limited tables presented in the Appendix have been found to be 
quite satisfactory for instructional purposes. The research psychologist would 
do well to have available the most complete tables. Two sources containing 
extensive tables are: R. A. Fisher, and F. Yates, Statistical Tables for Bio- 
logical, Medical, and Agricultural Research (London and Edinburgh: Oliver 
and Boyd, 1948); and G. W. Snedecor, Statistical Methods (Ames, lowa: 
lowa State College Press, 1946). 


From Theory to Practice 


Up to this point, the discussion has dealt only with the syntactical features 
of sampling error. The problem of realizing these features in practice is a 
Problem of semantics, which remains to be considered. Practical random 
sampling, as distinguished from theoretical random sampling, will be 
considered in Chapter 13, which follows. 


PRACTICAL RANDOM SAMPLING 


Semantics 18 


There are two major semantical problems in psychological statistics. One 
of these, the problem of confirmation in psychological measurement, was 
discussed in Chapter 5. The other problem has to do with confirmation of the 
correspondence between random sampling theory and random sampling 
practice. The problem of confirmation in sampling theory and practice is the 
source of much misunderstanding, confusion, and incorrect usage in psycho- 
logical research. 

There are two principal ways in which the researcher can go wrong with 
Fespect to the problem of correspondence between sampling theory and 
practice. First, he may fail to recognize that the problem has to do not only 
with what he thinks, but also with what he does. For him, correspondence 
between theory and practice, or the lack of correspondence, is not a reality. 
He may look upon correspondence as nothing more than a conceptualization. 
He may adopt random sampling theory as no more than a sophisticated view- 
point, currently in vogue in the best research circles. If he does accept the 
reality of Correspondence, he may err in thinking that its occurrence is 
repens He does not fully appreciate the fact that, within certain limits, 
he can actually make practice correspond to theory, and that his failure to do 
So leaves his work vulnerable to criticism and subject to rejection. He sees no 


of research operations and the statistical theory 
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his choice of a method of analysis with the character of his data. Critical 
features of his research operations may vary from one project to another, but 
this variation is not reflected in the kind of analysis he employs. The choice 
between description and inference is not determined by the nature of the 
Operations which produce his data. It is often determined by nothing more 
than a preference for a sophisticated method over an unsophisticated one. 

With respect to the correspondence between theory and practice, there are 
no standards, in an absolute sense, but there are standards in a relative sense. 
That is to say, correspondence is obviously better for some research opera- 
tions and some evaluations than for others. It is sometimes possible to be 
discriminating in the choice of operations and thereby improve corres- 
Pondence. When that is not possible, one can still be discriminating in his 
choice of a method of analysis. The discriminating researcher recognizes 
variation in the quality of research operations and is prepared to choose a 
type of evaluation which reflects this judgment and which is not likely to 
mislead his audience. Description and inference are the two kinds of analysis, 
between which a researcher can choose and thereby coordinate method of 
analysis and quality of operation. 

The second way in which the resea à 
misunderstanding of the consequences of various kinds of failure in corres- 


Pondence between theory and practice. There is no way of assessing over-all 
Correspondence. Practice may correspond to theory in one respect but not 
another. Furthermore, the degree of correspondence may vary. There is 
available substantial evidence as to the consequences of certain kinds and 
degrees of failure in correspondence. The consequences of one kind are known 
to be negligible. The consequences of another may be disastrous. All of this 
means that the psychologist must put the issues in a proper order; that is, he 
must put the more serious issues ahead of the less serious ones. He should, 
of course, try to satisfy all requirements perfectly, but if he falls short of the 
ideal, he should not become preoccupied with a trivial failure and ignore a 
Serious one, 


rcher can go wrong relates to his 


Dimensions of Correspondence 


We said above that there is no way of assessing over-all correspondence 
between random sampling theory and practice. Theoretical random sampling, 
as it was presented in Chapter 11, involved a known population distribution 
and a prescribed deductive system for obtaining the sampling distribution of 
Some i-statistic. In Chapter 12, the population distribution was specified as a 
Normal curve and the deduced sampling distributions were the ¢ function, the 
F function, and the normal curve representing the standard normal deviate. 
(See formulas 12.1 to 12.7.) For convenience in discussion, let us divide the 
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problem of correspondence into two parts: the characteristics of the popula- 
tion distribution and the system of obtaining samples. 

A normal curve is a continuous function, unimodal and symmetrical. (See 
Figure 12.1.) Its index of skewness is zero. Its index of kurtosis is three. 
Suppose the population distribution is not a normal curve. Suppose there is, 


as we say, some departure from normality. What are the consequences? Are 
they serious? 


Psychological measurements are always discrete variables, not continuous 
ones. The degree of discontinuity in the population may be great, as when the 
number of score intervals is two, or it may be small, as when the number 
of score intervals is one hundred. Whether the population discontinuity is 
great or small, the effect on a sampling distribution depends on the size of 
the sample. For any given number of score intervals, as sample size increases, 
the discontinuity of a sampling distribution diminishes. Furthermore, the 
effect will vary with the particular i-statistic under consideration. There is, 
of course, no absolute standard for deciding whether an effect is negligible or 
serious. Having surveyed the available evidence, we take the position here 
that the effect of discontinuity is negligible (1) when the number of values per 
sample is equal to or greater than twenty and the number of score intervals 
E equal to or greater than five; and (2) when the number of values per sample 
Is equal to or greater than five and the number of score intervals is equal to 


T greater than twenty. The choice of these values is, to some extent, arbitrary- 
hey are given simply as guides for the student who might be expected to 
form his own standards eventual 


There seve a ly after he accumulates more experience. 
i I5 evidence as to the specific effects of discontinuity on the sampling 
distributions of a Variety of i-statistics, but presentation of that evidence 
would take us beyond the reasonable scope of this discussion. 
: with respect t e of the 
population distribution are usually Sika dn NA Ld nd coat. 
tributions of some psychological measures for 
do display nonnormal characteristics. Many 


as follows: 


Dimensions of Correspondence 229 


1. For the slight to moderate departures from normality ordinarily en- 
countered in practice, the effect on the sampling distributions of t and F, the 
multiple-sample i-statistics which compare sample means through formulas 
(12.2) and (12.4), is negligible when there are ten or more values per sample. 

2. For the departures from normality encountered in practice, the effect 
on the sampling distribution of f, the single-sample i-statistic computed by 
formula (12.1), is more serious than the effect on the multiple-sample t and 
F, but is negligible when there are thirty or more values in the sample. 

3. The eflect of moderate departures from normality on the sampling 
distribution of F, the ratio of two sample variances, as given by formula 
(12.5), is serious for very small samples, but is negligible for samples of fifty 
values or more. 

4. The effect of moderate degrees of nonnormality on the sampling 
distribution of t, as in formula (12.3), or of F, as in formula (12.6), computed 
from a sample, product-moment coefficient of correlation, is negligible when 
the sample contains thirty or more values. 

The arbitrary standards stated above obviously do not cover all situations. 
There is a vast amount of rational and empirical evidence concerning the 
effects of skewness and kurtosis on various sampling distributions, but we 
cannot do justice to the evidence here. While the standards suggested above 
are necessarily somewhat arbitrary, they have been adopted only after very 
Serious study of the available evidence. There are some conspicuous gaps in 
the evidence on the effects of nonnormality. For example,we have not found 
evidence concerning the effect on the sampling distribution of 8, as computed 
from two sample correlation coefficients by formula (12.7). 

When we say that the sampling distribution of a given i-statistic.is affected 
by nonnormality, we mean that the shaded areas in curves such as those of 
Figures 12.9, 12.10, and 12.11 may, under conditions of nonnormality in the 
Population distribution, not be exactly what we expect them to be under the 
ideal, theoretical conditions represented by Tables A and B. When we say 
that the effects are negligible, we mean that the shaded areas, as a proportion 
Of the total area, may deviate from 0.05, but will lie between 0.04 and 0.06. 
Again the student must be cautioned not to interpret these values as absolutes. 
In our judgment, the effect would be serious if the shaded areas constituted 
as much as 0.10 of the total area when the ideal, theoretical value was 0.05. 

As it was described in Chapter 11, theoretical random sampling involved 
determining, by mathematical deduction, the nature of the sampling distribu- 
tion of a given i-statistic. Notice that a random sample was defined as any 
logically possible set of n values. It was not defined as a set of numbers con- 
Sisting of typical values or as any particular combination of values. The 
sampling distribution of some given i-statistic is itself a frequency distribution 
of the values of that i-statistic computed for each one of all possible, logically 
determined samples. What, then, is random sampling in practice? Is there 
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some set of concrete operations which deserve to be labeled as random 
sampling procedures and distinguished from other operations? 

At the risk of seeming dogmatic, we shall define practical random sampling 
as selection by means of a tested mechanical system. The system involves 
concrete operations which are instituted and managed by the investigator. 
Before the system is ever used in research, it is tested to determine whether or 
not it will produce empirically, for a given i-statistic, a sampling distribution 
which corresponds to the mathematically deduced, theoretical sampling 
distribution. If there is substantial evidence of correspondence between the 
empirical sampling distribution and the theoretical sampling distribution, the 
mechanical system passes the test. 

In real sampling, as distinguished from theoretical sampling, the selections 
are made from a population consisting of a large but finite number of existent 
and accessible units which can be measured. In psychological research, the 
unit of the population is a person, an animal, a behavioral event, or à 
behavioral product. The number of units selected is usually small relative to 
the number in the population. The population is real. That is, it exists and 


the units of the population are all equally accessible to the investigator doing 
the selecting. 


The Operations of Real Sampling 


The mechanical system most widely used for selecting units from a popula- 
tion is one whose central feature is a so-called table of random numbers. A 
table of random numbers is a collection of digits with values from 0 to 9. 
The collection is usually generated, initially, by some mechanical device, such 
as a lottery, a die, or a roulette wheel, or by similar electronic equipment. The 
table is tested to determine whether it can be used to generate one or more 
appropriate sampling distributions. If repeated use of the table does produce 
an approximation to the desired distribution, then use of the table for 
sampling purposes is considered acceptable. That is, the correspondence 
between theoretical sampling and practical sampling is taken to be confirmed. 

Broadly viewed, the system of real sampling has three phases: (1) units in 
the population are enumerated ; (2) numbers are read from a table of random 
digits; and (3) units in the population are identified. The specific steps in 
these three phases are given below. | 

1. Enumeration. Assign ordinal numbers to the units of the population in 
any arbitrary fashion. For example 


i : » if the population consisted of 9,253 
students listed in a university directory, 


0000 to 9,252 could be written beside the 
ment may be arbitrary, 


each of the ordinal numbers from 
n name of a student. Since the assign- 
the simplest procedure would be the best. Thus one 
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could begin at the beginning of the alphabetized directory and assign 
successive numbers to the names in alphabetical order. 

2. Reading. Decide in advance the direction in which you will read numbers 
from the table of random digits and what you will do at the end of an array, 
at the end of a page, and at the end of the table. Choose a starting point in 
the table by some mechanical means. For example, a die could be used. The 
die could be thrown several times to indicate the starting page, row, and 
column. Next read groups of digits of a size appropriate to your needs. For 
example, if the population consists of 9,253 students enumerated from 0000 
to 9,252, then you should read groups of four digits. It is usually advisable 
to record these groups in the order of their occurrence in the table. Read and 
record as many groups of digits as there are to be subjects in the sample. If 
the sample is to consist of 20 subjects, then it is necessary to read and record 
20 groups of four digits from the table of random numbers. Numbers which 
recur and numbers which do not appear in the enumeration can be ignored. 

3. Identification and selection. Take each group of digits and identify the 
unit bearing the same number in the enumerated population. Referring again 
to the population of 9,253 students, if one group of digits is 0389, look for 
the student whose name appears opposite that same ordinal number in the 
enumeration. He would be the 390" in the listing of the directory. Thus each 
group of digits identifies a subject for the sample. 


A DEMONSTRATION OF REAL SAMPLING 


Because of the importance of this system for obtaining samples and 
because students often have difficulty finding an account of the system, we 
Shall go to some lengths to describe and discuss a demonstration of it. 

We might demonstrate real sampling in any one of three ways: 

1. Using a large group of real subjects as the population. 

Example; Let the population consist of all male undergraduates listed in 
the student directory of University M. These students can be enumerated by 
writing ordinal numbers opposite their names. A sample of n names can be 
obtained by means of a table of random digits. Students whose names are 
selected are measured on a response variable, X. An i-statistic is computed 
from the sample of real measures. 

If it is desired, the sampling can be repeated many times. Each time, the 
value of the specified i-statistic can be computed. These values of the 
i-statistic can be arranged in a frequency distribution. This frequency 
distribution is actually an empirical sampling distribution for the specified 
statistic. 

2. Using a frequency distribution of real measures as the population. 

Example: Let the population consist of scores obtained from the files of 
the admissions office for students admitted to University M in September 
1960. Ordinal numbers are assigned to these scores. A sample of n scores is 
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obtained by means of a table of random digits. An i-statistic is computed from 
the sample of n scores. " . 

If it was desired, the sampling could be repeated and an empirical sampling 
distribution for the i-statistic could be established. . 

3. Using an artificial frequency distribution of values as the population. 
The student should understand that we are concerned at this point with the 
possibility of demonstrating real sampling. It is entirely possible to do real 
sampling from an artificial population. Of course, sampling from an artificial 
population is only of interest as a demonstration, whereas sampling from a 
large group of real students might be an integral part of a serious research 
design. As a matter of convenience in providing a demonstration of real 
sampling in this text, we shall employ an artificial frequency distribution of 
values as the population. . 

Example: An artificial population distribution of 10,000 values is given in 
Table 13.1. The value of the variable X is given in the first column. Score 


Table 13.1 
AN ARTIFICIAL POPULATION OF 10,000 VALUES 
X f Enumeration? 
9 30 0000-0029 
8 150 0030-0179 
7 630 0180-0809 
6 1,610 0810-2,419 
5 2,580 2,420-4,999 
4 2,580 5,000-7,579 
3 1,610 7,580-9,189 
2 630 9,190-9,819 
1 150 9,820-9,969 
0 30 9,970-9,999 


N = £f = 10,000. 
“Ordinal numbers have been assigned to the values in each score interval. 
intervals run from 0 to 9. The fre 


given in the second column, The 
been carried out as indicated in t 


quency for each of the 10 score intervals is 


2,500 digits arranged in 50 
Before we choose a start 
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a direction for reading. There are many legitimate ways of reading. On this 
Occasion, we choose to read down a column. When we reach the bottom of 
a column, we shall go to the top of the next column to the right and again 
read down. When we reach the end of a page, we shall go to the top of the 
first column of the next page. If we should reach the end of the table, we shall 


£0 to the top of the first column of the first page. 


Table 13.2 
PAGE 315 OF THE RAND TABLE OF RANDOM DIGITS« 


04053 17715 43400 34522 68089 30355 20322 52333 76076 91727 
31727 22402 11241 61640 02827 50795 70134 16582 69014 55896 
48972 14109 04845 34052 01783 00579 14707 56638 59995 56408 
35099 31960 51506 39727 29550 88060 21648 23421 43386 41416 
20483 88049 17811 70031 02221 30110 77697 63966 94679 10815 


68653 97277 62431 72251 36814 43076 68312 89337 68758 54435 
27371 05095 $7439 71320 85039 64115 81106 90143 48199 60039 
41026 60713 56004 82914 00715 83161 76961 58121 41355 06459 
06875 44626 99143 25641 71349 14100 49379 13105 36046 28527 
64218 57810 15692 15082 68878 08555 01203 61016 74250 64415 


52402 60583 71761 33176 75982 16737 28581 99542 06190 33634 
68304 38817 19251 13560 11777 05110 87031 23711 79992 13079 
23596 97483 30567 48193 72614 02713 63404 56120 92247 65664 
21925 00748 67909 76732 28409 60393 88008 24908 75505 25513 
32992 15761 19539 37828 11066 07225 00852 57776 71082 43426 


02937 37434 44330 47023 30645 72908 27689 81053 3443 23774 
39030 04243 06733 98748 25280 27477 27120 59275 50174 10209 
15868 20232 81938 86723 26893 36819 38430 68672 86559 73539 
39142 09183 85563 42090 32553 61305 35799 06371 90528 03077 
28809 31863 42937 34254 51279 72123 91317 44412 — 12396 07558 


11279 04679 55813 59654 90215 45054 10847 62762 
27400 21566 98177 23154 81545 43045 27143 44262 
24130 41836 54027 38882 49655 14151 93316 92291 
74420 83185 82692 73823 88512 65023 31227 68763 
64062 05593 00297 98838 58142 72612 53560 53124 


81185 97328 27471 78030 72060 02589 46817 24717 
15002 92991 95177 36298 16537 88473 64426 50984 
72830 98733 74928 01880 99084 20028 46629 62639 
75949 73331 34178 46882 22794 30529 
83753 27700 41183 93953 62126 78326 83809 69886 
"Lines of the original RAND table are numbered. These numbers have been omitted 


from the table as it is reproduced above. Adapted with permission from the RAND 
Corporation, A Million Random Digits (Glencoe, Ill.: The Free Press, 1955), p. 315. 
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Table 13.2 
PAGE 315 OF THE RAND TABLE OF RANDOM DIGITS (continued) 


25618 40076 81223 34072 58189 09393 38528 09972 55984 77363 
64302 92307 15246 15004 45925 65448 61462 38861 18612 90501 
26803 58650 80410 04526 99352 25352 56848 25675 88086 89600 
86214 24449 37100 30921 16325 65987 07534 24678 23150 20902 
10817 66999 12056 22844 03847 33336 52092 2425| 49702 24384 


22677] 51371 47284 22063 07727 31379 90462 65657 41798 00212 
19193 79282 30388 60077 25370 03559 03151 61731 42763 24632 
19052 65518 — 36005 78326 81155 97428 98679 15903 34641 51083 
03231 80995 66618 13905 90549 29499 88947 66575 02525 08398 
88063 16412 99869 38663 82026 06803 18966 78992 21280 29960 


26529 35993 24584 00452 11108 75438 06078 98785 38040 93746 
84241 57374 39160 27898 27412 35091 06716 78165 24196 12540 
30581 12894 02779 03668 48223 88050 00591 07738 87437 60512 
13719 64755 29962 32118 18738 30514 33012 89132 84340 97263 
14983 13884 48457 81853 74278 33136 20349 71379 97447 42103 


47762 02694 52343 62711 09088 35045 75288 00758 76053 60504 
26793 37177 06115 74306 79880 93933 64200 06100 40161 08666 
Ee 61888 — 93496 34850 97094 50528 26382 64887 71587 2195! 
= 22003 38235 87426 44051 03441 65669 16972 30122 16988 

3511 58403 56003 22978 30781 72495 67801 67423 20297 


We now proceed with the selection of a starting point. There are 400 pages 
of random digits in the RAN 

sections of 80 
E 


o the four pages: 313, 314, 315, and 
Which we ignore. Another throw yields 
starting point, 


pe oru our starting point. Five major columns 
can be distinguished A throw of the die produces a ** 1” and en us in the 
à UE 10 major rows can be divided into five 
pairs. A throw of the die yields a “ 3 ^ and puts us in the middle pair of the 

as now been narrowed down to the 
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arrangement of digits set apart by the rectangle of dotted lines in Table 13.2. 
There are two major columns in this arrangement. Let the left column be 
designated “ odd " and the right column, * even." A throw of the die yields 
a *2" and we choose the right column. There are two major rows in the 
column. Let the upper row be “ odd " and the lower row be “ even." The die 
says " 3" and we choose the upper row. There remains now a block of digits 
with five rows and five columns. A throw of the die for the column gives a 

3." Another throw for the row gives a “ 2" and the starting point is finally 
determined. The digit which serves as à starting point is a “ 7 ? in the twenty- 
second row and the eighth column of the 50 x 50 arrangement on page 315 
of the RAND table. 

Let us suppose that we want à sample of 10 values from the population of 
Table 13.1. We require 10 groups of four digits from the table of random 
digits, starting with the “ 7." Reading down the column yields the following 
10 groups: 7502, 2733, 8036, 4932, 5949, 3878, 6180, 5100, 6479, and 1218. 

By means of these 10 numbers, we can now identify 10 values of X in the 
population distribution of Table 13.1. The first number read from the table 
of random digits is 7502. In the distribution of Table 13.1, the value of X 
numbered 7,502 is 4. The second number is 2733. The value of X numbered 
2,733 is 5. Thus we can determine 10 values of X to which we were directed 
by the 10 groups of digits read from the table of random digits. Table 13.3 


Table 13.3 


A RANDOM SAMPLE OF TEN VALUES 
FROM THE POPULATION IN TABLE 13.1 


Corresponding 


Random 
Values of X 


Numbers 
7502 
2733 
8036 
4932 
5949 
3878 
6180 
5100 
6479 
1218 


OV PRORA UU UR 


gives the 10 groups of four digits from the table of random digits and the 
Corresponding values of X from the population in Table 13.1. The 10 values 
of X constitute a real sample from the artificial population. 

Having obtained a random sample of 10 values in this demonstration, we 
shall compute three i-statistics from those values. Among the i-statistics 
discussed in Chapter 12, there are three which can be computed from a single 


236 Practical Random Sampling 


sample of values of X. The three possibilities include M,, the sample mean; 
Vi, the sample variance computed with degrees of freedom; and t, a ratio 
computed by formula (12.1). To complete the demonstration, we shall com- 
pute 7. In the process of computing f, we shall have to compute the other two: 
M. and V}. Formula (12.1) is 


Ms — gk 
i JVin 
The required values can be computed, as follows: 
ZX 44 
Ma moms 
" , (ax? 44° 
Zx = LX? — A = = = 
x k 200 10 = 6.4; 
pr Zx? 6.4 
= ooh agr 0.711; and 
e =45, 


Substituting these values in formula (12.1) yields the following result: 


M,—p 44—45 
t= Vn = — = 0.375. 
JV 40.711/10 

Bp value of t computed for a random sample from the population in Table 
«l is —0.375. This completes the demonstration of the procedure for 

obtaining a single random sample. 
A to generate an empirical sampling distribution of t, we would 
LI c eve ofa sample many times and compute for each sample a 
his he 2) e requency distribution of a very large number of values of t, 
ined, would constitute an empirical sampling distribution, the counter- 
Aes ofa theoretical ¢ function. We shall not attempt here to generate a com- 
^ am Sé ien but, to illustrate the results of repeated sampling 
is ice i have selected four additional samples of 10 values from 
Ed m M NL able 13.1. Table 13.4 shows the numbers read from page 
eii table and the corresponding values of X from the popula- 
- The student can verify that the random numbers for the four additional 
samples follow those of the first sample in the RAND table. Table 13.5 con- 
tains values of Ma, Vas and t for each of the five samples. Figure 13.1 shows 
E distribution of the five values of t obtained from the five random 
jaa teh e INN verify the computation of the ¢ values from the 
It is important to understand that each of the five sets of values discussed 
above qualify as random samples because of the method used in selecting the 
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values and not because of any characteristics of the values themselves. Like- 
wise, the variability of the t values in Figure 13.1 is random variability because 
of the method used in selecting and not because of any characteristics of the 
Table 13.4 
FOUR ADDITIONAL RANDOM SAMPLES^ 


Sample 2 Sample 3 

Random Corresponding Random Corresponding 
Numbers Values of X Numbers Values of X 

1846 6 2909 5 

3438 5 7536 4 

6895 4 0373 7 

5488 4 8143 3 

9937 1 2339 6 

0549 7 6795 4 

7819 3 8496 3 

1979 6 7670 3 

5897 4 9912 1 

8015 3 8523 3 

Sample 4 Sample 5 

Random Corresponding Random Corresponding 
Numbers Values of X Numbers Values of X 

4454 5 1831 6 

4783 5 4336 5 

1410 6 9230 2 

5168 4 2450 5 

5917 4 9353 2 

1361 6 1417 6 

4088 5 2769 5 

4122 5 5190 4 

7681 3 7946 3 

7988 3 1521 6 


“Population in Table 13.1. 


Table 13.5 


VALUES OF Mx, Vi, AND t FOR THE FIVE RANDOM SAMPLES" 


Sample 
1 2 3 4 5 
M. 44 4.3 3.9 4.6 44 
Vi 0.711 3.122 2.989 1.156 2.489 
t —0.375 —0.358 — 1.097 0.294 — 0.200 


"Population in Table 13.1; sample values in Table 13.4. 
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Fig.13.1 The distribution of t values computed from the five random samples in Tables 
13.2 and 13.3. (Values have been grouped into intervals of 0.1.) 


research involves the selectio 


population of organisms, behavioral events, 
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of the artificial population. They also sometimes mistakenly conclude that the 
purpose of a demonstration is simply to express and to dramatize an assump- 
tion. This is not true. We repeat that the purpose is to direct attention to and 
explain the use of a table of random digits in the set of concrete operations 
which should be employed for sampling in psychological research. 


The Random Interval Procedure 


A table of random digits can be used correctly in a variety of ways, too 
numerous to list or discuss here. One variation in procedure deserves 
mention, however, because of its convenience in certain situations. One can 
sometimes avoid the burdensome task of actually enumerating the units of 
the population. For example, suppose one had a list of 5,000 voters and 
wanted a sample constituting 20 per cent of the total number. Taking every 
fifth name in the list would yield the desired number of subjects, but such 
à procedure is not acceptable because it cannot be subjected to a general test 
of randomness and, on any given occasion, it could be quite unsatisfactory. 
Instead of every fifth name, however, one might use a random interval, one 
determined by the succession of digits in a tested table. 

The random interval procedure involves the following steps: Adopt a 
direction for reading and choose a starting point in the table of random digits. 
Agree to read the digit “0” as “ 10." Read the first digit and count that 
number of names beginning with the first name in the list. The last name 
Counted goes in the sample. Read the second digit and count again, starting 
with the next name in the list. The last name counted goes in the sample. 
Read the third digit from the table, count names, and so on. The number of 
names skipped will vary from 0 to 9, but, over a large number of selections, 
will average very close to four names, thus insuring that approximately one 
fifth of the names will be chosen. The procedure can be adapted to other 
Population sizes and sample sizes by reading some combinations of digits and 


disregarding other combinations. 


Other Ways of Distributing Variability Randomly 


There are two situations in which the investigator does not actually select 
à sample from a population but can and does insure the random distribution 
Of certain kinds of variability over values he wishes to compute. When either 
Of these two situations obtain, it is considered quite acceptable statistical 
Practice to treat values computed from the collection of observations as 
i-statistics, that is, as values computed from random samples. The student 
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will undoubtedly experience some difficulty in perceiving any equivalence in 
these several approaches. One reason is that the equivalence is not exact. 
Another reason is that the equivalence of the actual procedures cannot be 
deduced mathematically. Another is that the common acceptance and 
justification of the equivalence are, to some extent, based on experience. 
Finally, there is a certain exercise of common sense and argument here which 
the student will not likely be prepared to grasp immediately. 

The first situation is one in which the investigator has a group of subjects 
which he wishes to divide into two (or more) subgroups so that the final 
difference between the subgroups constitutes a random event. He makes the 
division by sorting names into subgroups, with the sorting guided by a succes- 
sion of numbers read from a table of random digits. If he wishes to sort into 
two subgroups, he may read digits as “ odd " and “ 
names in the list into odd and even subgroups, as 
of digits. If he wishes to sort into three subgroups, he may read single digits, 
ignoring all but I's, 2’s, and 3's, and sort Successive names into groups 
numbered “ 1,” “ 2,” and “ 3.” Since there is nothing to do when all but one 
subgroup is filled but put the remaining names into that subgroup, it is 
necessary to randomize the original order in which names are listed. If 
the names are put on cards, then the cards could be well shuffled initially and 
then actually sorted into piles as dictated by the random digits. 

When a group of subjects is divided randomly into subgroups, these sub- 
Broups are often treated as random samples and values computed from them 


are considered to be i-statistics, One may reasonably ask, What constitutes 
the population in this case? There is no population in the sense implied by the 
original definition of the word. There is a Population, however, in the sense 
of a totality of logically i 


of n subjects each. For example, if N = 20, c = 2, and n = 10, then the total 


number of logically possible arrangements is 20!/10!10! — 184,756.! There 


has been some Study of the relation between sampling distributions for 
i-statistics based on 


even,” and sort successive 
dictated by the succession 


any one arrangement can 
that all arrangements are 


cies fly explained in Chapter 2. Example: 5! Sx 4x 3x 2X 
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measurement, each of which is associated with an error of some unknown 
magnitude. It is the investigator's desire to distribute errors associated with 
occasions of measurement randomly over the subjects to be measured. To do 
50, he assigns ordinal numbers to the subjects arbitrarily and then determines 
the order of occurrence of these numbers in a table of random digits. Their 
order of occurrence is the order in which he measures the subjects. Usually 
the research requires at least one repetition of the whole procedure. Certain 
values computed from the entire collection of observations qualify as i- 
Statistics. Here the population is the totality of orders into which n objects 
can be placed. The number is 7! The population of orders is real in the sense 
that any one of them can occur, but it is not accessible, in the sense that all 
orders are simultaneously available. 


Incorrect Practice 


When a tested mechanical device like a table of random digits is employed, 
random selection, random sorting, and random ordering, as well as com- 
binations of these, constitute the admissible procedures for insuring semantical 
Correspondence between theoretical sampling and practical sampling. What 
àre the consequences of failure in this correspondence? When the procedure 
used has not been tested or when the investigator has done nothing to meet 
this requirement, then the consequences are unpredictable. Unintended and 
unwanted differences in the observations, and in the values computed from 
them, may be small or large, may operate in one direction or another, and 
may be consistent or inconsistent. One would not know what to expect. The 
Shaded areas in the tails of the distributions of Figures 12.9, 12.10, and 12.11, 
and the form of those distributions, would be indeterminate. As a matter of 
fact, it would be more nearly correct to say these distributions do not then 
exist, 

Because of the drastic consequences whe : 
aspect of the problem of correspondence, the investigator should place this 
Issue first among his concerns. Since departures from normality in the dis- 
tribution of errors may not be serious, if he can insure that the unintended 
Variability in his observations is random, he has taken a giant step toward 
Insuring correspondence between sampling theory and practice. If he can also 
insure that discontinuity and other departures from normality are negligible, 
SO much the better. The frequent practice of ignoring the requirement that 
Variability be distributed randomly and, at the same time, investing the prob- 
lem of nonnormality with exaggerated importance cannot be defended. 

We have something more to say about this problem of distributing 
variability randomly. We believe it is incorrect and indefensible for a psycho- 


n there is a failure in the sampling 
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logist to call a set of measures a random sample when he has done m 
to insure that the set of measures has the properties of a random sample. 5s 
may try to justify himself by saying that the set of measures can be ny ii 
of as a random sample from some hypothetical population. In doing so, He i 
referring to the particular values of the set and is considering them as w 
of some imaginary population. He is mistaken in defining a random samp 
in terms of the values in it. He is completely disregarding the method o 

obtaining the values. Actually the populations which could give rise to a 
sample containing certain specified values are innumerable, A random 
sample, theoretical or practical, is defined by the procedures of selection. 
Calling a set of measures a random sample, when nothing has been done to 


i ; is 
insure that the set of nieasures has been selected as it should have been, 
simply unrealistic, 


Transformations 


Certain kinds and de 
by applying one of s 


BRL à s 
statistics are computed. In Chapter 4, several examples were given of set 


hose distributions were asymmetrical originally, but became 
symmetrical wh 


y is such that a proper trans- 


formation will improve the distribution and bring about closer correspondence 


between theory and practice., 
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requirement of a normal distribution is not so serious as to preclude the use 
of sampling theory in the evaluation of research. With respect to the second 
part of the problem, sampling distributions do not exist when there are no 
operations which insure that variability is randomly distributed. Our con- 
clusion is that sampling theory does not apply when there is a failure in the 
requirement that variability be randomly distributed. 
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STATISTICAL INFERENCE 
AND RESEARCH 


Pragmatics | 14 


The research psychologist can never be certain that his observations of 
behavior are uncontaminated by error. No matter how careful he may be 
in planning and conducting a study, a multitude of influences, unintended and 
unwanted by the researcher, produces in his data effects of unknown magni- 
tude. These unintended differences or biases make interpretation of the results 


of research quite difficult and introduce into the interpretation a degree of 
uncertainty which cannot be eliminated. 


course, be satisfied with thi 
knowledge in the form of ge 

Inductive inference, 
from the individual ev 
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inference cannot be accomplished except through statistical procedures. 
Physical scientists have not been dependent on statistics for the high order of 
the inductive inferences which have characterized their research, There is a 
great deal more to research and the acquisition of knowledge than statistical 
manipulation of data. There is little doubt, however, that, in the analysis of 
data and in certain formal aspects of research design, the methods of 
Statistical inference represent the most rigorous logic men have yet been able 
to achieve in this connection. . 

It is, of course, in the very nature of inference that it is accompanied by 
uncertainty and that it is subject to error. Statistical inference shares these 
faults. There is a sense, however, in which statistical inference can be said to 
be rigorous. In any given investigation, the probability of an incorrect 
inference is known and stated. a" 

Before we become involved in the specific applications of statistical inference 
in Psychological research, it would be well to consider two distinctions which 
àre extremely important for an understanding of these applications. We can 
distinguish two kinds of procedures in statistical inference: the testing of 
Significance and the estimating of population values or parameters. We can 
also distinguish two kinds of research: experimental research and empirical 
research. These terms will be defined and discussed later at length. At the 
Moment, it would be appropriate to examine the relation between the 
Statistical procedures and the two kinds of research. 

Since the two statistical procedures can be employed in the evaluation of 
either kind of research, the subject matter of statistical inference and its 
applications in research can be divided into four topics representing the four 
Possible combinations of procedure and kind of research. These four topics 
are given by the following brief outline: 


A. Experimental research 

1. Testing significance 

2. Estimating parameters 
B. Empirical research 

1. Testing significance 

2. Estimating parameters 


Although the relative importance of these four topics might be ramp 
it is our opinion that a careful survey of the actual research being done by 
Psychologists and the kinds of statistical procedures being used in the evalua- 
tion of that research would reveal that testing significance occurs much more 
Often than estimating parameters. There is a reasonable explanation for this 
ne-sided emphasis on testing significance and some comment will be mies 

Y Way of explanation of the situation in Chapter 16. Furthermore, althoug 
here is probably some general inclination for research psychologists to view 
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experimental research as being more important than nonexperimental. re- 
search, we take the position here that both are important and merit eee 
as long as they are clearly distinguished. Under the assumption that both 
kinds of research are important, but that testing of significance is generally 
of more interest to the research psychologist than estimating parameters, we 
shall emphasize the testing of significance in separate discussions of experi- 
mental and empirical research. Testing significance in experimental —— 
will be taken up in Chapter 15; testing significance in empirical research wil 
be considered in Chapter 16. . 

Although the testing of significance will be emphasized, we do believe that 
the student should be introduced to estimation procedures. Our choice of 
examples for an introduction to estimation places the discussion in Chapter 
16, which deals with empirical research. As we have indicated, estimation has 
many applications in experimental research, but space does not permit us to 
pursue the topic that far. Consequently, nothing will be said about estimation 
in Chapter 15, which deals with experimental research. 

In labeling the two kinds of 
we do not mean to imply that ex 
Acquisition of experience thro 
of both kinds of research. Acc: 
planned investigation with the 
fication or measurement, 


issue can be expressed in terms which make 
obvious the nature of a correct resolution of that issue, one is then on firm 
ground. If a research issue remains vague in one's thought and expression. 
and if its resolution is largely a matter of conjecture, then there is little chance 
for success, Rigorous thought and language should be the immediate 
objective of the Student who hopes to become à research psychologist. 


of the term “ experiment." Some of the worst 
would-be research 


There is much loose usage 
usage comes about when a 
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If everything is an experiment, there is no need for the term. It becomes 
synonymous with “ research." An important discrimination is thereby lost. 

There is a line of thought which connects experimental research with the 
determination of causes. While this usage comes very close to the important 
distinction on which we have been insisting, it ultimately runs afoul of the 
problem of defining * cause ” and the many philosophical issues which that 
Problem entails. Consequently, the word “ cause " becomes a liability rather 
than an asset in making explicit the difference between the two kinds of 
research. 

The distinction between experimental and empirical research is actually a 
simple one and its importance is obvious when one takes a direct and straight- 
forward view of the research situation. The research psychologist engages in 
One or the other of two clearly distinguishable kinds of operations. In the one 
kind of operation, he installs himself at some vantage point and observes the 
behavior of human or animal subjects in an environment whose character- 
istics he may also record. The information he acquires from observing may 
turn out to have broad generality and may be of great practical consequence. 
In the other kind of operation, he controls certain conditions, manipulates 
One or more others, and observes the effect of the manipulation on the 
behavior of his subjects. Again the information he acquires may possess 
8enerality and practical value. As we indicated earlier, the first operation 
characterizes empirical research; the second operation characterizes 
Experimental research. : 
_ As long as the psychologist reports faithfully what he did and as long as his 
Interpretation of his results is appropriate to what he did, there is no problem. 


The trouble begins when he engages in the first operation and tries to interpret 


his results as indicating what would be found out from the second operation. 
han observe, but draws con- 


That is, he observes, and does nothing more than obs on 
Clusions from his data as to what would happen if he intervened. Now it is 


Obvious that the most certain knowledge about intervention is going to come 
rvention takes place. When the researcher 


a conclusion about the effects of interven- 
ffect some manipula- 


from investigations in which inte 
does nothing more than observe, 
tion can be a very bad guess. If he wishes to learn what e 
tion of conditions has on the behavior of subjects, he should perform exactly 
that manipulation, Even if he actually does manipulate conditions, the results 
Will entail some degree of uncertainty because of error, but if he does not 
Manipulate, the prospect is far worse. C 
The controversy over the connection between lung cancer and smoking is 
àn excellent example of this issue. There seems to be little doubt that scientists 
have observed a relation between the incidence of lung cancer and smoking. 
hat the nature of this relation is, remains quite uncertain. What they have 
Not observed is the effect of manipulating smoking. To be exact, they have 
Observed that the incidence of lung cancer is greater among smokers than 
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among nonsmokers. They have not observed that the incidence mpi peu 
increases among nonsmokers when they take up smoking. Nor be when 
observed that the incidence of lung cancer decreases among smo on ane 
they quit smoking. They have not shown that young people could avo wee 
cancer by not smoking and that the same young people would contrac es 
cancer if they did smoke. They have not shown that elimination of smo 
would eliminate the lung cancer now known to be associated with it. i 

To sum up, the researcher may observe or he may manipulate and s 
Observation alone may yield valuable information. However, if he wish 
to know the effect of manipulation, he must do it. He cannot claim to know 
the effect of manipulation until he does. 4 

There is one further complication in the issue. Although one does not fin 
out about manipulation by simply observing, it is entirely possible that an 
observed relation will be confirmed in experimental manipulation. si 
that one has observed a relation between two variables—an environmenta 
condition and some aspect of behavior. Is it possible that changing the cone 
dition would change the behavior? Yes, but the experimental demonstration 
remains to be done. Thus the observed relation c 
an experiment in which it is dem 
change the behavior. 

Here is an example. Scientists observ. 
among children in communit 
fluorides is lower than the in 
not contain those chemicals. 
of tooth decay and presence 
they know that introducing 
decay? No, it does not. It 
associated with the natural f 
such an action will have the 


ould serve as a pretext n 
onstrated that changing the condition doe: 


€ that the incidence of tooth bn 
ies having water supplies which contain natura 
cidence in communities whose water supplies do 
A relation is thus established between incidence 


T" , DOTT 
We have chosen “ empirical research ” as the label for those investigation 
: : e- 
we shall reserve “ experimental r 


in which the psychologist actually 
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uncommon to encounter a student who expresses a preference for a particular 
i-statistic because, as he says, he wants his study to be an experiment. In 
thinking that the choice of a particular i-statistic can make his study an 
experiment, he has failed to understand the difference between an experiment 
and an empirical investigation. A given i-statistic may be appropriate for an 
evaluation of either kind of research. The beginning student would probably 
be astonished if we were to describe here all of the equivalent methods of 
analysis which can be applied equally well to experimental and empirical 
research data. 

Although the choice of method of analysis does not determine whether the 
research is experimental, appearances might give the student the impression 
that it does. By appearances, we mean that there is some tendency for experi- 
mental data to be evaluated by certain methods while nonexperimental data 
are evaluated by other methods. There is some tendency for experiments to 
be evaluated by means of i-statistics, such as ¢ and F, computed from means 
and variances. There is some tendency for nonexperimental research to be 


evaulated by the same i-statistics computed from correlation coefficients and 
Other measures of association. These tendencies reflect superficial, rather than 
necessary, correspondences. 

The two major applications of statist 
estimating parameters, will be discusse 


are treated in detail in the following chapters. 


ical inference, testing significance and 
d next, in general terms, before they 


Testing Significance 


Observation and measurement in research are always subject to unintended 
and undesired influences which scientists call error. Consequently, in 
announcing positive results, the investigator may be mistaken no matter how 
favorable those results may appear to be. While the researcher cannot avoid 
being misled by outcomes due to error, he can be selective or discriminating 
With respect to those outcomes in two Ways: First, within the totality of 
Strictly error outcomes, he can define a class of the most favorable outcomes, 
by which he is willing to be misled. Second, he can place any desired limit 
9n the size of that class. His erroneous decisions to announce positive results 
Will then occur with a relative frequency which is known and can be stated. 


_ The evaluation of research data by application of a statistical test of 
Significance requires computation of an i-statistic which embodies a com- 
Parison of interest to the investigator. The comparison is a difference, a ratio, 
Or a combination of the two. To define a class of error outcomes about which 


he is willing to be mistaken and, further, to specify its size, the investigator 


Must be able to deduce the nature of all possible outcomes for the com- 
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parison. That is, he must know the sampling distribution of an i-statistic 
whose logical properties are relevant to his purposes. Within this sampling 
distribution he can establish two classes of values: error results which appear 
least favorable, or those values of the i-statistic which are smaller than some 
criterion value; and error results which appear most favorable, or those values 
of the i-statistic which are larger than the criterion value. In evaluating his 
research data, he computes the i-statistic which embodies the comparison of 
interest to him and observes the class into which the obtained value falls. If 
it exceeds the criterion value, it belongs in the class of error outcomes about 
which he is willing to be mistaken and he announces positive results. If it 1s 
exceeded by the criterion value, it belongs in the class of error outcomes by 
which he is not willing to be misled and he acknowledges negative results. 
The size of the class of error outcomes which appear most favorable and 
which will be the occasion for announcing positive results erroneously 1$ 
arbitrarily determined, but it seems reasonable and consistent with the pur- 
pose of scientific research to make the size of the class small rather than 
large. The size of the class is expressed as a proportion of the totality of error 
outcomes. A proportion of 0.05 has gained wide acceptance as being small 
enough to afford the researcher, his colleagues, and his public a reasonable 
degree of protection from false announcements of positive results. Some 
researchers, who are more demanding of themselves, adopt a proportion of 
0.01, thus reducing the size of the class of erroneous decisions considerably- 
i To further focus attention on the general characteristics of tests of 
significance, we shall give here an example in which the sampling distribution 
is identified but no details are given about the investigation itself. In the 
evaluation of a certain body of research data, the i-statistic computed was F, 
a variance ratio. The sampling distribution of F for this particular set of data 
could be represented by the function in Figure 12.10. Bear in mind that the 
sampling distribution is a specification of all of the possible error outcomes 


for the investigation. The point on the horizontal axis which separates the 
unshaded area under the curve from the shaded area represents the criterion 
value, F,. The shaded area re 


Y aded arca represents the class of error outcomes about which 
the investigator is willing to be mistaken. The unshaded area represents the 
class of error Outcomes about which he is not willing to be mistaken. Having 
compe * the investigator determines in which class it belongs. He does 
So by consulting a table of F such as Table B i ix. If the 
y or B, in the Appendix. I 
obtained F exceeds the ; a ^ 


ce. criterion value given by the table, the investigator 
announces positive results. If it does not, he acknowledges negative results. 


Classifying a given i-statistic as above or below some criterion value in the 
totality of error result: 


: $ I5 an action consistent with the viewpoint that the 
result is due to error, a viewpoint which is referred to formally as the nul 
hypothesis. When the obtained i-statistic exceeds the criterion value and 
positive results are announced, the investigator is said to reject the null 


Estimating Parameters 2al 


hypothesis. When the value fails to equal the criterion and negative results 
are acknowledged, the investigator is said to accept the null hypothesis. The 
proportion of error outcomes which are the occasions for announcing 
positive results or rejecting the null hypothesis is called the level of significance. 


Estimating Parameters 


" by: have said that values computed from a population are called parameters 
2n that values computed from one or more random samples are called 
Nieder. A Some values computed from samples correspond to values 
fee = from populations. For example, there is the mean ofa sample and 
eile ean of the population from which the sample was obtained. There is the 
inda e ofa sample and the variance of the population from which the 
of ple was obtained. There is the product-moment correlation for a sample 
paired values and the product-moment correlation for the population of 
Den values from which the sample was obtained. Population values or 
T NEE are designated by Greek letters. Sample values or i-statistics are 
Signated by Latin letters. 
h on real population is too large to 
faa o large to permit the computation 
oom researcher to obtain an estimate of o 
general terms, an attempt is made to in 
l-statistics. 
Nee of parameters are divided i 
nterval estimates. 
A point estimate is a single value whic! 


sa s : 
ra and which possesses certain proper 
e corresponding parameter. We shall consider only point estimates which 


e unbiased or approximately so. An i-statistic is an unbiased estimate of a 
ee when the mean of its sampling distribution is equal to the para- 
pis r. It has been our experience that the beginning student is usually not 
ie for this definition. “ Unbiased ” usually has connotations for the 
h ent which go beyond the somewhat limited and arbitrary definition of 
* statistician. Examples of unbiased estimates will be given in Chapter 16. 
mend interval estimate consists of a pair of values called limits, which are 
P don from one or more samples. These limits, which establish an interval, 
Bie sampling distribution. The interval has the property that it will some- 
iste MEQUE the parameter and will sometimes fail to do so. In the 
fail. Th of such intervals, some known and stated proportion of intervals will 
small € size of this proportion can be fixed by the researcher and made as 
as he desires. The size of the interval depends on the chosen proportion 


measure and describe, that is, when 
of parameters, it is often of interest 
ne or more of the parameters. In 
fer values of parameters from 


nto two broad classes: point estimates 


h is computed from one or more 
ties which qualify it as an estimate 


252 Statistical Inference and Research 


of failures. The smaller the proportion of failures, the larger the interval. 
Several interval estimates will be described in Chapter 16. E of 

We shall comment here briefly on our reasons for emphasizing pe 
significance. In testing significance, when an investigator rejects t a a 
hypothesis and announces positive results, he is under no obligation tor um 
the size of the obtained difference or ratio in terms of the original scale units. 
On the other hand, in establishing an estimate of a parameter, it is in the eni 
nature of the procedure that the size of the computed value is indicated. 
When the estimate is expressed in terms of units on a psychological scale, its 
size may be difficult to interpret because of the unknown differences among 
scales, even those intended to measure the same characteristic of Lic tpe 
Consequently, psychologists are not prone to emphasize the magnitude o 
differences, ratios, or other values, expressed in terms of original scale units. 
Estimation is more likely to be used when the value of interest to the 
researcher is a pure or dimensionless number like a proportion, a percentage, 
or a correlation coefficient. 

There is another reason for the em 


psychological research. Much of 
control 


phasis on tests of significance Hu 
PSychological research is exploratory; 
s frequently of dubious quality. To 
nder these circumstances is itself no 
estimation of the size of a parameter 
will come when measurement, predic- 
arch will improve to the point that the 
ficance to estimating parameters. 


accomplishment. Attempting the 
is often not justified. Perhaps the day 
tion, and control in psychological rese; 
emphasis will shift from testing signi 


eee f A al 
Sts of significance in experimente, 
deals with tests of significance and estimates 


parameters in empirical research 


EXPERIMENTAL 
INFERENCE 


Pragmatics | 15 


An experiment is a planned event in which the researcher manipulates one 
or more variables and determines the effects of this action by making observa- 
tions on another variable. A manipulated variable is called an independent 
Variable. The variable on which observation is made is called a dependent 
Variable, Although either kind of variable, independent or dependent, may be 
qualitative or quantitative in nature, we shall limit our discussion to 
quantitative variables. 

In psychological experimentation, the manipulation of an independent 
variable takes a characteristic form. The independent variable is the feature 
Which varies in a set of conditions, often called treatments, and the manipula- 
tion is the actual imposition of these conditions on groups of subjects. A set 
of conditions may consist of a number of stimuli which vary along some 
Specific dimension, a number of different levels of some characteristic of the 
environment which can be controlled by the experimenter, varied allowances 
Of some consumable substance such as water, food, nutrients, or drugs, or 
Variations in tasks and programs prescribed by the experimenter for his 
Subjects, Listed below are four examples of independent variables described 
as sets of conditions. . . 

A Two auditory signals consisting of pure tones equal in intensity but 
differing in frequency. 

2. Three levels of illumination for a testing room. 

3. Three different concentrations of salt in solution in drinking water. 

4. Two different amounts of practice: five trials and ten trials. 

The dependent variable in a psychological experiment is a measure of 
response of the subjects in the experiment. The subjects may be human or 
animal, Typically, subjects differ in their responses, when they are measured 
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under uniform conditions, and it is not uncommon for these differences to 
persist in repeated measurement. This is to say that pyschologists frequently 
encounter evidence of stable or reproducible differences among their subjects. 
Because subjects differ, the selection of subjects for the conditions of an 
experiment can bias the final comparisons. Thus differences among subjects 
constitute a major source of error in psychological experiments. 

Subject differences are not the only errors by which experiments can be 
biased. A multitude of influences affect the experimenter's observation ofa 
subject's response under a given condition at a specific time and place in the 
conduct of the experiment. The experimenter, of course, tries to control and 
hold constant as many of these influences as he can, and to whatever extent 
he can, so that unintended differences will be eliminated and only intended 
differences will occur in the data. Intended differences are those he is trying 
to produce by means of the varied conditions of the experiment. He can never 
be certain that these efforts to control extraneous influences and eliminate 
bias are completely successful. . 

The presence of error and bias in experimental data introduces uncertainty 
into the evaluation of the data. In announcing positive results for an experi- 
ment, the investigator can never be certain that he has not been deceived by 
error. The only kind of rigor he can achieve in his evaluation is the rigor of à 
test of significance in which the probability of an incorrect rejection of the 
null hypothesis is known and stated. Meaningful application of a test of 
significance requires, of course, that the distribution of errors have the 
attributes of random variation. 

Three simple plans for conducting and evaluating experiments are described 
below. Plans for experiments are often called experimental designs. 


The Completely Randomized Design 


Imagine an experiment in wh 
of two conditions, A, and Ay. 
on a group of subjects, after 
response variable, Y. 


ich we wish to determine the differential eien 
Each of the two conditions is to be impose 
Which the subjects are to be measured on 4 


ences among them will act as errors 
son of conditions in many different 
ects and on the particular arrange 
produced by the assignment. f 
cted (2n) subjects from some large supply ° 


subjects, that we have assigned them to two conditions with n subjects P*T 


The Completely Randomized Design 255 


condition, and that we have imposed the conditions and obtained measures 
on the dependent variable. We recognize that the particular arrangement of 
subjects which results from the selection and the assignment is only one of 
many possible arrangements. 

Table 15.1 shows the main features of the design and the way of recording 


Table 15.1 


AN EXPERIMENTAL DESIGN WITH TWO CONDITIONS 
AND TWO RANDOM SAMPLES 


Conditions 
Ai As 
First Second 
Sample Sample 


the data which consist of (21) measures. Each column of the table corresponds 
to a sample and the condition which was imposed on the subjects in that 
Sample, Notice there are no rows in the table. The double subscripts permit 
the identification of any measure in either column. 
a is obvious that the total variability in the (2n) observations will contain 
eis different kinds of differences. It will contain differences due to the 
a. of the treatments, to the extent there are such effects. It will contain 
= erences among subjects, to the extent that preexperimental inequalities of 
1 Ponse have persisted into the experiment. Finally, it will contain unin- 
ended differences arising from extraneous influences which the experimenter 
has failed to eliminate. The intended differences are óften called treatment 
Tror., 


effects. The unintended differences are often called e 

The total variability in the (27) observations can be analyzed into two 
Components: the between-sample sum of squares and the combined within- 
Sample sum of squares. It will be interesting to consider the relation between 
the two components produced by the arithmetic analysis of the total sum of 
Squares and the two sources of variability described above as treatment effects 
= error. Under certain circumstances to be described below, the treatment 
in will be contained in the between-sample variability and nothing but 

or will be contained in the within-sample variability. 

Treatment effects can be divided logically into two classes: those effects 
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which are constant and additive and those which are not. Let us paruis 
first possibility, the case of constant and additive effects. We shall E P 
to the other possibility. A treatment effect is constant and additive x es 
subjects in a particular condition of the experiment change with t F D 
increment or decrement of response. When the effects of conditions tet 
A, are constant and additive, but different, the differential treatment uy 
is contained in the between-sample variability and does not enter the withi 
sample variability. 

The conclusion that the differential effects of treatments will not be con- 
tained in the within-sample sums of squares can be justified in the following 
manner. It was established in Chapter 4, in connection with the discussion 
of coding transformations, that adding a constant to, or subtracting a con- 
stant from, each number in a set does not change the variability of that set. 
A constant additive treatment effect can be conceptualized as a constant value 
added to, or subtracted from, the response of each subject in a sample. Con- 
sequently, the variability among the measures of response in a sample will not 
be different from what it would have been had there been no treatment. That 
is to say, the variability within a sample will not have changed during the 
experiment. Since the total variability consists only of two components, the 
between-sample variability and the within-sample variability, the effects of 


treatments must be contained in the between-sample variability. 
The between-sample sum of s 


who have been treated differen 
constant and additive treatmen 
sample sum of squares may als 
and assignment of subjects. Bec 
the samples may not be exactl 


quares measures differences among subjects 
tly and, therefore, contains any differential 
t effects, but that is not all. The between- 
© contain a bias resulting from the selection 
ause of the random selection and assignment, 

y comparable at the start of the experiment. 
Thus we often say that the between-sample variability is the apparent treat- 
ment effect, by which we mean that it is the treatment effect plus some 
unknown bias. 


The error effects in the ex 
among subjects and differe 
influences Operating in the 
moment on th 
Differences a 


periment consist of preexperimental lane 
Nees arising from uncontrolled and unintende 


o which the experiment is liable. 


Let us review the equivalences of the terms and concepts introduced above 


The Completely Randomized Design 257 


to characterize this simple experiment involving two conditions and two 
random samples of subjects. 

The total sum of squares is a measure of the variability in all of the data. 
It contains differential treatment effects, differences due to inequalities among 
subjects, and other errors. The total sum of squares can be analyzed into two 
components. 

The between-sample sum of squares measures differences among subjects 
treated differently. It contains differential treatment effects and, possibly, a 
bias from errors. It is referred to as the apparent treatment effect. 

The within-sample sum of squares measures differences among subjects 
treated alike. It contains subject inequalities and errors of other kinds. From 
the within-sample sum of squares we obtain an estimate of the sizeof errors 


affecting the experiment. 


THE GUARANTEES OF RANDOM ERROR 


Two kinds of errors affect the comparisons in a psychological experiment. 


One kind consists of inequalities among subjects. These inequalities become 
errors, in the selecting and assigning of subjects with respect to the conditions 
of the experiment. The experimenter can insure that these errors are randomly 
distributed over the conditions by using a table of random numbers or some 
other tested device to make the selection and assignment. The other kind of 
errors consists of unintended differences produced by uncontrolled influences 
and associated with the particular occasion on which an observation Is 
obtained. The experimenter can insure that errors associated with occasions 
of Measurement are also random in their distribution over the conditions of 
the experiment. He can do so by arranging for the random conjunction of a 
Subject, a treatment, and the occasion on which the subject is measured under 


that treatment in the experiment. 


THE EVALUATION 


The random selection of subjects and the ran A 
and conditions to occasions of measurement can produce any one of many 


different combinations of an apparent treatment effect and an estimate of 
error, That is, random selection and assignment can produce many different 
combinations of values for the between-sample variability and the within- 
Sample variability. Just on appearances, the outcomes which are most favor- 
able for the announcement of positive results are those which suggest large 
treatment effects and small errors. Even though it must be admitted that these 
Most-favorable-appearing outcomes could actually be due to random error, 
if one must risk being misled, it is reasonable to take one’s chances on them 
rather than on the outcomes which suggest small treatment effects and large 
errors, 


dom assignment of subjects 
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i i tcomes 
The experimenter needs a systematic procedure for uei kien d 
which give the most favorable comparisons of "iW omi dapi iter 
i-statisti hich provide the desired com . 
rror. There are two i-statistics w e | pes 
iine employed in a test of significance provides the desired systematic p 
cedure. One is t as computed by formula (12.2), which is 


D, 
42V, [n i 
the 
This ratio compares the apparent treatment effect, as measured by 


1 i omputed 
difference between two sample means, with an estimate of error c p 


from the within-sample variability. The other i-statistic is F as computed by 
formula (12.4), which is 


t 


V! 

F= -ż. 

Vo r 

d by 

This variance ratio compares the apparent treatment effect, as menu $a 
the between-sample variability with the estimate of error, as given by 
combined within-sample variability. M ie 

In the test of significance, the result of the comparison is classified i 
larger or smaller than a predetermined criterion value. If the obtaine 


ie willi be 
falls in the class of error outcomes by which the experimenter is inr qe 
misled, he rejects the null hypothesis; if it does not, he accepts the 
hypothesis. 


With two conditions and two samples, ¢ by formula (12.2) and F m 
formula (12.4) give exactly the same result for the test of significance. En 
matter of fact, i? = F when c, the number of samples, is two. F has a 
en the number of conditions is greater od 
valuation is usually reported as an analyst 


à of 
re commonly represented in the form 
Table 15.2 


Table 15.2 


THE ANALYSIS OF VARIANCE 
ELY RANDOMIZED DESIGN 


GENERAL FORM OF 
FOR A COMPLET 


Component of Variability 


SS df V E s 
Between samples or conditions B c—1 V; VilVw 
Within samples or conditions Ww N—c V 

Total T N-1 


B = the between-sample sum of squares, 


W = the combined within-sample sum of squares, 
T = the total sum of Squares. 


NONADDITIVE EF FECTS 


"us r or, 
Constant and additive treatment effects do not affect the estimate of err 
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Which is com i 

ent citum wees — the combined within-sample sum of squares. Treat- 

Mos m n constant and additive may change the variabilit: 

and te apa Se T rem are treated acm d 
c ween MR itive, the result may be differences 

didus 1 s not solely due to error. There is 

bs ak dit iba oar involving two conditions and two sb 

iu ME involves the comparison of one sample variance with 

ratio, as given by formula (12.5), 


The test i 

Slaid p vie called a test of homogeneity of sample variances. When the 

Vases: Arias nin we often say that we have found heterogeneous 

student will f is also a test for the case of more than two samples. The 
nd the test described by Edwards (I), Ray (2), Walker and 


Lev (3), and others. ' 


AN EXAMPLE 


A y 

He ogis obtains two random samples of rats of a particular strain. 
to the ee a small, carefully measured amount of a common stimulant 
containing sd gns sample. He administers a carefully prepared placebo 
number of revol of the stimulant to the other animals. He then observes the 
hour Stier tr] olutions of an activity wheel each animal produces in the first 
tions. Tie has been fed to satiation. Table 15.3 gives the recorded-observa- 

psychologist proceeds to analyze the variability in the 20 observa- 


ti 
ons, as follows. 


ACTIV Table 15.3 
ITY MEASURES FROM THE STIMULANT EXPERIMENT 
Conditions 
Placebo Stimulant 
Sample 1 Sample 2 
7 4 
13 43 
42 30 
17 47 
9 23 
20 31 
20 32 
15 34 
14 15 
22 43 


N 
= 2: 
number of animals. 
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Summary data for the entire set of 20 measures in Table 15.3: 


ZY —481 
BY? = 14775 
N =20 
(EY? 4812 — — 
C, = F^ = 11,568.05. 


Computation of the total sum of squares: 


ZY? — C, = 14,775 — 11,568.05 = 3,206.95 
Summary data for Sample 1: 


ZY =179 
ZY? = 4,057 
n = 10 


Computation of the sum of squares for Sample 1: 


ZY% 792 
zrn = 4,057 — 1? 


——. = 852.9 
10 
Summary data for Sample 2: 
ZY = 302 
ZY? = 10,718 
n = 10 
Computation of the sum of squares for Sample 2: 
ZYE 2 
Ay: T — 10,718 m — 1,597.6 
n 10 


Computation of the combined within-sample sum of squares: 


ZY% 
Sfer = =] = 852.9 + 1,597.6 = 2,450.5 


Computation of the between-sample sum of squares: 


A(Xyyp 179? + 302? 
EE xh LI s — 11,568.05 — 756.45 
The fact that the total sum of squares should be equal to the nee 
sample sum of squares plus the combined within-sample sum of square 
provides a check on the computation. 
3,206.95 = 756.45 + 2,450.5 
The computation checks, 
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V, the variance for the apparent treatment effect, is computed by dividing 
the between-sample sum of squares by (c — 1) — 1 degree of freedom. 


Meus TS ane 
1 
Ví, the variance for the estimate of error, is computed by dividing the 
combined within-sample sum of squares by (N — c) — 18 degrees of freedom. 


iii s 2,450.5 
, 18 
The null hypothesis is that observed differences between conditions are due 


to error. Let the level of significance be 0.05. The test of significance involves 
computing F, the ratio of V; to V, or the ratio of the treatment variance to 


— 136.14 


the error variance. 


Vy " 756.45 — 5.56 


F = y: = 736.14 
F., the criterion value of F for one and 18 degrees of freedom and the 0.05 
level of significance, is 4.41. (See Table B; in the Appendix.) Since the 
Obtained F exceeds the criterion F, the psychologist rejects the null hypothesis 
and announces positive results. The results are summarized in conventional 


form in Table 15.4. 


Table 15.4 
ANALYSIS OF VARIANCE FOR THE STIMULANT EXPERIMENT”? 
Component SS df V à n" 
Apparent treatment effect 756.45 I 756.45 . 
Estimate of error 2,450.5 18 136.14 
Total 3,206.95 19 


‘Data in Table 15.3. is is rej 
Criterion value, F, is 4.41; null hypothesis is rejected. 


A test for nonadditivity of treatment effects should also be applied. The 


É ' 
test involving F requires the computation of a variance for each sample. ry 
the Variance for the first sample, is obtained by dividing the sample sum o 


Squares, 852.9 by (n — 1) — 9 degrees of freedom. 
p a 852.9 _ 94.77 
1 9 
V5. the variance for the second sample, is obtained by dividing 1,597.6 by 
das 0 degrees of freedom. 


UT Le = 17751 
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Since tables of F give ratios greater than one, we compute by dividing the 
larger sample variance by the smaller. 
NNUS NN 
F =p = 9477 = 187 

Since this procedure of putting the larger sample variance over the smaller 
will double the number of ratios greater than one, we shall need to compensate 
by choosing a level of significance which when doubled will give us the 
desired value. We choose 0.025. The effective level of significance is then 0.05. 
The null hypothesis is that the observed difference between variances is due 
to error. F, the criterion value for nine and nine degrees of freedom, is 4.03. 
(See Table B,.) The null hypothesis is accepted and the assumption of 
constant additive treatment effects is taken to be justified. 

Even if significance is obtained in the test of homogeneity of sample 
variances, there is no reason to abandon the main analysis. The consequences 
of nonadditivity are not serious. It has been shown that heterogeneity of 


sample variances has the effect of making the true level of significance slightly 
larger than the chosen level, 


The Matched-Group Design 


Imagine that we are to conduct an experi 
Ag, anda dependent variable, Y. If we cho 
Possible in their Tesponses, impose one condition on each subject, and 
measure on Y, the compariso 


made no attempt to match subj 


If We assign the members of each pair randomly, one to each condition: 
the residual errors, Tepresenting failures in 


y matching, will be distribute, 
randomly at the sam š subject combinations for ec 
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take place, then the errors associated with occasions of measurement will also 
be randomly distributed over the conditions of the experiment. 

Imagine that these randomization procedures have been carried out, that 
the conditions have been imposed, and that measures on the dependent 
variable have been obtained. The data can be tabulated in two columns, 
corresponding to conditions A, and A, and as many rows as there are pairs 
of subjects. The general plan for two conditions and r matched pairs of 
Subjects is shown in Table 15.5. Recall that the total sum of squares in a 


Table 15.5 
GENERAL PLAN FOR A MATCHED-PAIR DESIGN 


Conditions 
Ai As 
1 
2 
3 
Pairs . 
Jj 


double classification can be analyzed into three components: the between- 
Column sum of squares, the between-row sum of squares, and the residual 
Sum of squares, 

The magnitude of differences among pairs, relat 


Vill depend on the effectiveness of the matching proce i 
is nearly perfect, errors will be very small and most of the intersubject vari- 


ability will appear in the between-row component. If the matching is poor, 
errors will be large and much less of the intersubject variability will appear 
in the between-row component, on the average. Large or small, differences 
among pairs do not enter into the evaluation of the experiment. Since the 
experimental comparison can be made within each pair, only the failures in 
Matching must be taken account of in the evaluation. The extent to which a 
Blven pair differs from any other pair is of no concern to us. 

Differential, constant, additive treatment effects will enter the between- 
column component and will not affect the between- 
residual variability, Since between-row variability consis 
among pairs and between-column variability contains treat 
residual variability must provide an estimate of error. That c 


ive to the total variability, 
dure. If the matching 


row variability or the 
ts of differences 
ment effects, the 
onstant additive 
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treatment effects do not affect the residual variability can be deduced, as 
follows. . . 

Let Y;, be the measure on the dependent variable for the jt» matched pair 
and the K'^ treatment. Now Y;, can be conceptualized as consisting of two 
parts: the level of response which would have been obtained without treat- 
ment and the increment (or decrement) of response produced by treatment. 


If we let Lj, be the level of response without treatment and a, be the 
increment or decrement produced by treatment, then 


Vin = Lik + ay. 
This equality statement should not be interpreted as meaning that E 
experimenter ever knows the values of Lj, and a,. If he actually knew the 
value of a,, there would be no point in doing an experiment. We have iced 
duced these symbols for deductive purposes only. They do not have to be 


given numerical values to serve our purposes. Notice that a, varies only 


from column to column, but that it is a constant from row to row, within any 
column. 


The mean of the jth row can be written 


Y; = D; + a, 
where d, is the average of the c values of a. 
means with one another, a process implicit in the computation of the between- 
Tow sum of squares, à, will cancel. For example, the difference between the 
mean of the third row and the mean of the fifth row can be written 


Ys — Yo Iy = Dn 


Thus the between-row variability will be a measure of the differences between 
pairs and will be unaffected by the treatments. 


The mean of the kth column can be written 


x in any row. In comparing TOW 


Y, = D + ap 


In comparing column means, a process implicit in the computation of the 


ares, values of a, will not cancel. For ion 
mean of the first column and the mean of t 
second column will be 


Macs y ema ai) — (La + 43). 
Since the two treatment effects, a, and 4», may differ, the between-column 
variability will contain this differential effect. 
The residual deviation for the j row and the kth column is 


Imo Ya = Yj — Tx +M, 


M ,, the total mean, is equal to the average row mean or the average colum? 
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mean. In either case, 
M,—M a. 
We have said that ! dis 
Ys La * 
and have shown that ü PM 


and 
" Y; = L, + ay. 
ubstituting for Y;,, Y;, Y.» and M, in the formula for r;; yields 
rjj = (Lin + a4) — (Ly. +8) — (La + ag) + (Mi +å). 


Removi 

o ini i 

aH oe parentheses and combining terms gives an expression for rj, in 
Ich a, and d. do not appear: 


fj = Lg — Ly — La + Mie 


vule that the residual variability does not contain the constant, 
E effects of treatments, that it reflects failures in matching which have 
e randomly distributed over conditions, and that it provides an estimate 
error, 

Pio iode in the data for a matched-group design can be analyzed into 
colima mponents: the apparent treatment effect as measured by the between- 
e a of squares, the differences among the matched groups as 

by the between-row sum of squares, and the error variability as 


me d E : 
asured by the residual sum of squares. The investigator evaluates the 


experi f ` N : 
Periment by comparing the treatment variance to the error variance in a 


of significance, He chooses a level of significance. The treatment variance, 
eon ee by dividing the between-column sum of squares by (c — 1) 
residual of freedom. The error variance, V^, is computed by dividing the 
involves cnn of squares by(r — Ile — 1) degrees of freedom. The comparison 
computation of an F ratio, 
aes 
F= y: 
hid investigator compares the obtained F with F,, the criterion value, and 
m or rejects the null hypothesis. 
anap $ general form of the conventional prese 
ysis of variance, applied to a matched-group 


ntation of the results of an 
design, is given in Table 15.6. 


TRE MATCHING PROCEDURE 
e measurement of all subjects prior to the 
X which is known to 


f subjects according 


T . : 
ie first step in matching is th 
: Hn and under uniform con 

orrelated with Y. The second step is the ranking o 


ditions on à variable 
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Table 15.6 


THE ANALYSIS OF VARIANCE FOR 
A MATCHED-GROUP DESIGN 


Component SS df V B 
Treatments C &—1 Vi VilVe 
Groups G r—1 
Error R (r — Dc — 1) V 

Total T N-1 


between-column sum of squares. 
between-row sum of squares. 
residual sum of squares. 

total sum of squares. 


to the magnitude of their measures on X. The third step is the pairing. The 
first and second subjects in the ranking are put in one pair; the third and 
fourth subjects are put in a second pair; the fifth and sixth are put ina third 
pair; and so on. Matching in this fashion on X indirectly matches the 
members of each pair on Y. The effectiveness of the matching depends, of 
course, on the correlation between X and Y. ` 

The members of each matched group are assigned randomly to the 
conditions of the experiment. Each subject-treatment combination is then 
randomly assigned to an occasion of measurement. 

When the number of conditions is greater than two, the matched groups 
must be correspondingly larger. Thus for an experiment involving three con” 
ditions, each matched group would consist of three subjects. Subjects woul 
be ranked on the preexperimental variable, X. The first three subjects in the 


ranking would then be chosen for the first group; the next three subjects 
would be placed in the second group; and so on. 


AN EXAMPLE 


A psychologist wishes to determine the effect of paying subjects for their 
participation in an experiment. He chooses a measure of visual acuity aS the 
dependent variable. He obtains a sample of 20 male, freshman college 
students. He measures all of the student subjects on the test of visual aculty 
under standard conditions prior to the experiment. He next ranks the ? 
subjects on this preliminary measure and then forms 10 matched pairs. bn 
members of each pair are then assigned randomly to the two conditions: Ap 
no payment for participation in the experiment: Aa payment of one dollar 
per hour for participation, Subjects in condition A, are informed they will b 
paid. Subjects in A, are not told they will be paid. Measures of visual acuity 
are obtained on all subjects. To insure the maintenance of good public 
relations, subjects in A, are paid, after the experiment is over. 


The data for the visual acuity experiment are given in Table 15.7. Measures 
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are expressed in milli i i 

oo. millimeters. The analysis and evaluation of the data are 
Table 15.7 


THE MATCHED-PAIR EXPERIMENT 
ON VISUAL ACUITY 


Conditions 
Ay As 


Pairs of 


Subjects 


X 8E 2 a XX & Xu dd 


5 


Samary data: 
Cor kirki number of rows; c, the num 
"rs er of observations. 
any saliin the observation in any C 
Original n; ZY, the sum of all values; 7 
values. 


ber of columns; N = (rc), the 


ell; R, the sum of any row, C, the sum of 
y2, the sum of squares of all. 


r=10 c=2 N — 20 
ZY = 284 Xy? = 4,166 
ZR = 8,274 ZC? = 40,378 

284° 


= = 8 
" C, 20 4,032. 
omputation of the total sum of squares: 
Xy: — C, = 4166 — 4,032.8 — 1332 


Co; 
m, ; . 
iputation of the between-column sum of squares: 


f cesses 


AE 40,378 _ 4932.8 = 5.0 
r 10 
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Computation of the between-row sum of squares: 


SES e S274 _ 40328 = 104.2 
= 


Computation of the residual sum of squares: hen 
Let rj, be the residual deviation for the jt row and the kt" column 

Nin = Yu — Y, — Y, 4+ M, 

rı = 19 — 18.5 — 13.7 + 14.2 = +1 

ro, = 14 — 16.5 — 13.7 + 14.2 = —2 


Fa = 0 Fog = +2 
ra = +l ra = 0 
ra = 0 ra — —l 
fgg uec "s = 0 
mm ——l oe = +1 
ra = 0 ræ = +1 
rg) = +2 rg, = 0 
Tor O0 To, = —2 
Fu ——l ros = 0 
Ar = 24 


TT f the 
The fact that the total sum of Squares should equal the combination sen 
three components provides a check on the computation. We observe 


133.2 = 5 + 104.2 + 24, 
The computation checks, 
Table 15.8 shows the su 
the F ratio for the test of 0.0 
between conditions are due to random error. The criterion value for the O- 
level of significance and o 


in the Appendix.) The o 


btained value of F, 1.87, is not significant. The ape 
hypothesis is accepted. The researcher does not announce positive m he 
may decide to abandon this line of research or to try to improve 
experiment and repeat it, 
Table 15.8 
ANALYSIS OF VARIANCE FOR THE MATCHED-PAIR DESIGN’ 
Component SS df V F 
Treatments 5.0 1 5.0 1.87 
Groups 104.2 9 
Error 24 9 2.67 
Total 133.2 19 


“Data in Table 15.7. 


^F. the criterion value for the 0,05 


f 
2 egrecs O 
5 level of significance and one and nine deg! 
freedom, is 5.12. The null hypothesis is accepted. 


269 
A Design with a Correlational Adjustment 


a ona dependent variable, Y, for two random samples 
eet ee conditions have been imposed, we can present the data 
Sed n Table 13.1. This table was used earlier in the discussion of the 
mp etely randomized design. We can use the table again as the starting 
pon for our discussion of a design with a correlational adjustment. 
" oe that differences among the observations in Table 15.1 may be 
"obs to differences which existed among the subjects prior to the 
the ei and wuch persisted through the experiment. We could adjust 
A ations on Y, for the purpose of eliminating these initial differences, 
We ir E Mia measures obtained on the subjects prior to the experiment. 
for x then eliminate that part of the variability in Y which was accounted 
wal. d ariability existing prior to the experiment as measured by X, a 
realmi correlated with Y. By doing so, the size of the bias in the apparent 
bias d em would be reduced, on the average, and the estimate of that 
Eu even also be reduced a corresponding amount, on the average. The 
Pew ould be an experiment subject to errors which were smaller than those 
* completely randomized design. 
"anis e a correlational adjustment, ; 
subject ry. Obtain two random samples of subjects 
l'a Son x under uniform conditions. Impose t i 
» One condition on each sample. Measure the subj 


the following operations are 
from a supply. Measure all 
he two conditions, A, and 
ects on Y, the dependent 


aS 
ariable. Table 15.9 gives the general form in which the measures can be 
arranged, 
Table 15.9 
GENERAL FORM OF THE DESIGN INVOLVING 
A CORRELATIONAL ADJUSTMENT 
Conditions 
Ai As 
Sample 1 Sample 2 
Ss = Subjects. 
ples of 


Cons; 

~ ~NSide i 

Paireg lesan new way of analyzing the variability in the two sam 
u : 
Tes represented in Table 15.9. 
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Let r, be the product-moment correlation between X and Y over the entire 
table, disregarding the classification by samples. Then r? is the proportion of 
Y variability correlated with X, and (1 — rẹ?) is the proportion not correlated 
with X. 

Let Sy? be the total variability in Y as given by the sum of squares of 
deviations from M,, the total mean. The total variability includes treatment 
effects, differences among subjects, and other errors. 

Then r? Zy? represents the predicted variability, a sum of squares reflecting 
differences among subjects to the extent that these differences can be pre- 
dicted from X. Furthermore, (1 — r?) Zy? represents the unpredicted variabi- 
lity, a sum of squares containing treatment effects and errors other than the 
predictable differences among subjects. It will be convenient to represent this 
unpredicted sum of squares by the expression, (T + Ej). T stands for treat- 
ment effects; E, stands for error or bias in the unpredicted sum of squares 
when it is computed in this particular way. This expression is not meant to 
imply that values can be determined for T and E, separately. If they could be, 
evaluation of the experiment would be very simple. The expression, (T + £)» 
stands for a single quantity which, we can deduce, contains the differential 
effect of treatment and the effect of errors in unknown amounts. The student 
may wonder how we can deduce the fact that treatment effects will be in the 
unpredicted sum of squares rather than in the predicted sum of squares. 
Assigning subjects to conditions randomly insures that X is random with 
respect to the effects of the conditions. Therefore, the differential effects of the 
conditions will not be correlated with X and will not be predicted from X, 0^ 
the average over many experiments. 

We enter upon a new phase in the analysis by defining another product- 
moment correlation coefficient which can be computed from the data. Let rw 
be the combined within-sample correlation between X and Y. That is, 


"= [Ele Ss Dye 

To compute the numerator, we find the sum of products of deviations for 
each sample, separately, and then combine these quantities. To compute e 
double sum ion. the left in the denominator, we find the sum of squares ° 
deviations in X for each sample and then combine those sums. The double 
sum on the right in the denominator is computed in the same way. What We 
are doing is computing the correlation between X and Y after the between" 
sample variability has been eliminated. In other words, we are computing the 
correlation between X and Y using only the withinzsample variability for 
each variable. 


Then r$, is the proportion of within-sample Y variability correlated with 
the within-sample X variability, and (1 — r3) is the proportion not correlate 
with X. 
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B esie pis is the combined within-sample variability in Y. It includes 
For cesi ong Laien treated alike and other errors, but does not 
AE d me eae they are constant and additive. 
geen eae hat rod Ey is the within-sample predicted variability, a sum of 
differences suung differences among subjects, to the extent that these 
mean i. be predicted from X. Furthermore, (1 — rz) ZZy? is the 
Bis ny e unpredieted variability, the sum of squares which provides an 
PO EE error. Let E, be this error sum of squares. It can be computed 
B s o estimate E,, the error component or bias in (T + Ej. 
y the subtraction, 
(T + E) — Ew 

call Tpiasea, the apparent treatment 
ch measures the treatment effect as it 
herefore call it the apparent treatment 


se A a quantity which we shall 

ie 3 biased IS à sum of squares whi 

a oA been biased by error and we t 
. We can evaluate Tpiasea in à test of significance. 

The analysis and evaluation are summarized in Table 15.10. This kind of 


Table 15.10 
GENERAL FORM OF THE ANALYSIS OF COVARIANCE 
Component SS df V F 
Treatment effect — Tuaset ¢-i i VilVe 
Error E. N—-e-! Ve 


a = sum of squares for apparent treatment effect. 


= sum of squares for estimating error. 


r the dependent variable are adjusted 


a "n ; 
nalysis, in which the original data fo 
ubjects as measured by 


wit h ^ 
d = Tespect to preexperimental differences among $ 

n. ele X, is often referred to as an analysis of covariance. 
is appropriate to test the sample variances on Y for evidence of non- 


additivi 1 š 
or CUM of treatment effects. It is also appropriate to test the homogeneity 
aen pe correlation coefficients as providing further evidence on the non- 

itivity of treatment effects. These tests for nonadditivity will be illustrated 


in the example which follows. 
Fond aai iin of the covariance design in red 
e $ on the magnitude of the correlation between 
the ons is zero, there is no point in doing an analysis 
mee orrelation is zero, one discards the preexperimental measures and does 
ma mple analysis of variance on the Y measures. One's decision about the 
ml m of the correlation is based on a test of significance of the within- 
the ple predicted variability. A predicted variance, V, can be computed from 

e within-sample predicted sum of squares, rz E Ey by dividing it by one 
we of freedom. F is the ratio of Vj to Vi, the error variance from the 
in analysis of Table 15.10. 


ucing the size of errors 
n X and Y. If this cor- 
of covariance. When 
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AN EXAMPLE 
Table 15.11 contains experimental data which will be analyzed and 
evaluated by the analysis of covariance. The data were obtained from an 


experiment in which the psychologist wanted to determine the effect 2 
practice on scores for a mechanical assembly test. The experimenter obtaine 


Table 15.11 
DATA FOR THE ASSEMBLY-TEST EXPERIMENT 


Conditions 
As As 


Sample | Sample 2 


PWN 
E 
RWN 


e 


x 
4 
3 
4 
3 
5 
6 
5 
3 
3 
3 


O00 -10t 
UH Cà n ROS S UA L9 
O0 -10 t 
aA dA dA de UD 02 O3 CA CA OX 


Ss — subjects. 
X = preexperimental test score. 
Y = dependent variable test score. 


two samples of subjects and measured all subjects on a different assembly test 
under uniform conditions prior to the experiment. He then imposed the 
practice conditions, which were one hour for the first sample and two hours 
for the second sample. Subjects practiced on the initial assembly test. He then 
administered the assembly test which yielded the scores on the dependent 
variable. A score on either test indicated the number of assemblies completed 
in a five-minute period. 


Summary data for the entire set of measures: 


ZX =79 ZY = 148 
ZX? = 329 ZY? = 1,334 
ZXY = 617 
N = 20 
Computation of the total sums of squares and products: 
AX 2 jo 
2H?) == BX — =a = 329 -E — 16.95 


20 
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: . Qr 2 
zy = Ey: e 1,334 m = 238.8 


KEY 79)(148 
Exp = EXY (79148) — 
N 617 20 32.4 


uares, Sy, to obtain (T + Ej), the 


The adjustment of the total sum of sq 
defined by the expression 


sum of squares of errors of prediction, is 
3 1 —)ày* 
Which can be written as i ES 
Eam n LES cass us 
may ay E 238.8 — 1695 176.867. 


rrors of prediction based on the total s 


ums 


Ti E,), the sum of squares of e 
quares and products, is 176.867. 


Summary data for Sample 1: 


EX = 39 ZY =59 
ZX? = 163 ZY? = 459 
SEXY = 252 
n= 10 
Computation of sums of squares and products for Sample 1: 
(EX? _ 163 — 7h = 109 


AE = 2S 
n 


H 59* 
zy = ZY? NOM = 459 -0 = 110.9 
n 


5 39)(59) 
Zxy = ZXY ZXIY 0s —‘ X 319 
n 

Summary data for Sample 2: 
Ex =40 Ey =8 
SX? = 166 Apte 
yyy = 365 
n=10 


Computation of sums of squares and products for Sample 2: 
40° 


IXY 
zx -sx —& »- 166 — 16 = 


89? 
z 2zr- E dii gg rm 
a (4089) . 
yam mean 10 =9 
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Combining of sums of squares and products for the two samples: 


= 10.9 + 6 = 16.9 
110.9 + 82.9 = 193.8 
21.9 +9 = 30.9 


The adjustment of the within-sample sum of squares, 22y?, to obtain Ew 
the sum of squares of errors of prediction based on the within-sample sums 
of squares and products, is defined by the expression, 


(1 = r2) 2 Zy?, 


which can be written as 


& g 2 (22xyyP 
ZXy — ry = Zp — Exe 
E,, is computed, as follows: 
30.9? 
193.8 — i69 ^ 137.302 


Thus E,, the sum of squares of errors of prediction based on the within- 
sample sums of squares and products, is 137.302. 


Tpiasea, the sum of Squares for the apparent treatment effect, is the 
difference, 


(T + E) — E, = 176.867 — 137.302 = 39.565. 


Table 15.12 gives Toiasea, Ew, 


degrees of freedom, variances computed from 
the two sums of squares, and t 


he F ratio comparing the apparent treatment 


Table 15.12 
ANALYSIS OF COVARIANCE FOR THE ASSEMBLY-TEST EXPERIMENT" 
Component SS df Vv F 
Treatment 39.56 1 39.56 4.90? 
Error 137.30 17 8.08 


“Data in Table 15.11. 
vF, for the 0.05 level of 


the 0.( significance and one and 17 degrees of freedom is 4.45; the null 
hypothesis is rejected. 


effect with the estimate of error. The null hypothesis is that the apparent 
treatment effect is due to error. F, for the 0.05 level of significance and one 
and 17 degrees of freedom is 4.45, 


(See Table B, in the Appendix.) Since the 
obtained value of F, 4.90, exceeds Fo, the null hypothesis is rejected and the 
investigator announces Positive results. 

The main evaluation of the assembly-test experiment has been described 
above. The test of significance of the correlation between the dependent 
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variable, Y, and the adjusting variable, Y, will be described next. This order 
of events is followed here because of its convenience in explaining the 
analysis. In actual practice, the test of significance of the correlation between 
X and Y would precede the main evaluation. If the correlation proved to be 
nonsignificant, an analysis of covariance would not be used for the main 
evaluation. 

The correlation to be tested is r,, which is based on the within-sample 
sums of squares and products. A convenient way of performing the test is to 
compute the within-sample predicted sum of squares, 


re Ddy, 
which can be written 
(E Xxyy 
Dax? 
and computed, as follows: - 
30.9? 
m = 56.50. 


The predicted variance, V7, is computed by dividing the predicted sum of 
Squares by one degree of freedom. Thus 


of F, the ratio of the 


TI igni ires the computation I 
Ne test of significance requires vite bs. Thali, 


H t Li 
Predicted variance, V/, to the error variance, Vi, fro 


RO 1053 
vi 8.08 

The null hypothesis is that the observed correlation between X wi Yis x 
to error, F, for the 0.05 level of significance and one and 17 ond 
freedom is 4.45. (See Table B, in the Appendix.) Since the n by 
Exceeds the criterion value, the null hypothesis is rejected. Ee n d. im 
Significant correlation between XY and Y justifies the use o Ja 
Covariance in the main evaluation, which has already been cede —_— 

There are two tests for nonadditivity of treatment effects. The cto se 
the Comparison of one sample variance with the other. ies gr 
Sample variances is carried out only for the dependent varia dedos 
treatments could not have had any effect on X. The variance 
Sample is 


110.9 _ 
y, =p = 1232. 


The variance for the second sample is 


82.9 
Vi—79:— 9.21. 
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The ratio of the larger variance to the smaller is 


y, 1232. 
= y; = ogy = L4 


Since we have compared the larger with the smaller, we must choose a value 
of F, which is exceeded by 0.025 of values in the sampling distribution to 
achieve an effective level of significance of 0.05. The null hypothesis is that 
the observed difference between the two sample variances is due to error. 
The criterion value, F,, for nine and nine degrees of freedom is 4.03. (See 
Table B, in the Appendix.) Since F is less than F,, the null hypothesis 15 
accepted. 

The second test for nonadditivity is a test of the homogeneity of the sample 


correlations. We compute £,, the sum of squares of errors of prediction for 
Sample |, as follows: 


" (Exyf 21.9? 
Ey = 2j 3 = 110.9 — 109 ^ 66.90. 
For Sample 2, the sum of squares of errors of prediction is 
2o». Qo? &— 9? 
E, = g — met 82.9 — ge 69.4. 


E, the combined sum of squares of errors of prediction from the 
individual samples, is computed, as follows: 


E, = E, + E, = 66.90 + 69.4 = 136.30. 
The difference, 


E, — E, = 137.30 — 136.30 = 1.00, 


is a measure of the heterogeneity of the sample correlations. It can be 
evaluated by comparing its variance, V},, computed with (c — 1) = 1 degree 
of freedom, with V^, a variance computed from E; with N — 2c = 20 — 4 = 
16 degrees of freedom. Thus 


yi —. E _ 136.30 


N= 7 36 ^ 8.52, 
and 


The null hypothesis is that the observed difference between sample correla- 
tions is due to error. F, for the 0.05 level of significance and one and 16 
degrees of freedom is 4.49. (See Table B, in the Appendix.) The null hypothesis 
is accepted. Since sample variances and sample correlations were not foun 


to differ significantly, the assumption of constant and additive treatment 
effects is taken to be justified. 
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Summary 


Three designs for simple experiments have been presented in the present 
chapter. The first design required that subjects be selected randomly, and that 
subjects and treatments be assigned randomly to occasions of measurement. 
There is no provision in this first design for reducing the size of errors to 
which the experiment is liable. Consequently, when subjects are heterogeneous 
and uncontrolled influences have large effects, this completely randomized 
design may yield data containing large biases. The second design required 
that subjects be measured, under uniform conditions prior to the experiment, 
On a variable related to the dependent variable. Subjects are ranked and 
formed into matched groups. Subjects within each matched group are then 
assigned randomly to conditions and to occasions of measurement. To the 
extent that the matching variable and the dependent variable are related, the 
Matching will be successful and the errors will be small. The third design 
required the random selection of subjects and random assignment to con- 
ditions and occasions of measurement, with measurement prior to the experi- 
ment on a variable known to be correlated with the dependent variable. After 
the conditions are imposed on the random groups and after measures on the 
dependent variable have been obtained, the resulting data are adjusted by 
Computational procedures which take advantage of the correlation between 
the two variables. . 

An example was given for each design. The examples involved only two 
Conditions, The designs can, of course, be enlarged to incorporate any number 
9f conditions. For examples in which the number of conditions 1S greater than 
two, the student might wish to consult a text on experimental design by 


Ray (2) or others. 


In this discussi «peri 1 rch, on 
s discussion of experimental research, 

i imati r is 

Considered. No attention was given to the estimation of parameters. It 


Possible to compute, from experimental data, estimates of means, — 
and differences between means, but the procedures of estimation are se | 
Used in psychological experimentation. One reason is the — 4 a 
the units in many psychological scales. Another reason !5 iet 
Psychological experimentation is exploratory and detecting the C1 


- i i t. 
effect of two or more treatments is itself quite an accomplishmen 


ly tests of significance were 


TIENE chological Research (New York: 


ts Edwards, A La Experimental Design in Psy 

Rinehart, 1950). ] ; in 
2. Ray, W, S., An Introduction t0 Experimental Design (New York: Macmillan, 
1960). 


a Walker, H, M., and Lev, Joseph, $t 


atistical Inference (New York: Holt, 1953). 


INFERENCE IN EMPIRICAL RESEARCH 
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In what we choose to call empirical research, the psychologist observes, 
establishes relations among variables, and makes predictions based on those 
relations, He does not deliberately interfere with or change natural processes 
for the purpose of determining the consequences of such action. Experi- 
mental research is, of course, a more ambitious undertaking than empirical 
research. Experimental knowledge bestows the power to control natural 
events. Because men in practical affairs place high value on experimental 
knowledge, empirical studies are often valued less highly than experiments. 
Nevertheless, empirical research can produce knowledge which makes it 
possible to predict natural events. It is often the only kind of research which 
the psychologist is permitted to undertake by society. In many areas of human 
behavior, experimentation is out of the question. 

Statistical inference is employed in the evaluation of empirical research 
data in both of its major forms: testing significance and estimating para- 
meters. It is not possible to present all of the specific applications of sig- 
nificance-testing and estimation in empirical research. We have chosen only 


a few of the more important tests and estimation procedures for discussion 
here. 


Tests of Significance 


Three examples of tests of significance as they might be applied to data 
from empirical investigations are described below. 

1. A psychologist wishes to know whether the student bodies of two large 
universities differ in verbal fluency. It is not practical to test all of the students 
so he obtains a random sample of 100 students from each university and 
administers the test to students in both samples under standard conditions. 
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He cor 

Pera the mean of each sample, the difference between means, the 

E oo sum of squares, and the combined within-sample 
. He then computes ¢ by formula (12.2), 

_ Da 

J2V n 

n sample means, V, is the combined 


f subjects per sample. Vj is 
ample sum of squares by 


t= 


Seale D, is the difference betwee 
rare’ ee variance, and n is the number o 
Qn — E kia dividing the combined within-s 
dibisener E of freedom. The null hypothesis is that any observed 
ifie. n gs means is due to random error. He sets the level of sig- 
Table A in a oe then refers to a table of to classify his result. (See 
emer ih ppendix.) If his obtained value of t exceeds ta positive, or 
positiver y to negative, he rejects the null hypothesis and announces 

results. If it neither exceeds nor 1S exceeded, he accepts the null 


Ev and acknowledges negative results. 
oo Ed aire in the first example may 
the diffe variances. Doing so would serve two purposes. First, the test of 
samples € between means provided by formula (12.2), applied to two 
variances IE) different populations, requires that the populations have equal 
he will oe he compares the two sample variances in a test of significance, 
actually b ain evidence bearing on this requirement. In addition, he may 
Not ail ae in comparing the dispersions of the two student bodies. 
neris b the information one might desire in comparing two groups is 
sarily contained in a comparison of means. To compare sample 


Varia 
riances, he computes F by formula (12.5), 


also decide to compare the 


F= y à 
sum of squares by (n — D degrees of 


way. The null hypothesis is that any 
ces is due to error. Since 


ve " 

fete sob by dividing a sample 

Fide V; is computed in the same 

tables ed difference between the two sample variances 1$ ] 
of F usually give only ratios greater than one, it 1$ à general practice 


to € à 
place the larger variance in the numerator and the smaller in the denomi- 
he number of values of F greater than F, for 


nat i 
Or. The effect is to double tl d achi level of si 
achieve a level of sig- 


a oF 
i level of significance. To compensate an 
“ance of 0.05, the psychologist chooses à value of Fe exceeded by 0.025 
ee Table By in the Appendix.) 


bution. (S 


with F,, and accepts Or rejects the null 


o 
dei values in the sampling distri 
hy nen compares the computed F 
Pothesis, 
"bei educator has decided t 
obtain, and grades in the junior hig 
S a random sample of junio 


gate the relation between verbal 
metropolitan area. He 
nts and tests them on 


o investi 
h schools of a large 
r high-school studer 
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verbal ability. He also obtains their average grades for the preceding term. 
He computes the product-moment correlation coefficient for the sample and 
from that value computes t by formula (12.3), 


iz FN =2. 
Jl — rà, 


In the formula, r, is the sample correlation; N is the number of subjects in 
the sample. The null hypothesis is that any observed correlation is due to 
random error. He determines the criterion value of t for the 0.05 level of 


significance and (N — 2) degrees of freedom, and accepts or rejects the null 
hypothesis. (See Table A in the Appendix.) 


Estimating Parameters 


Within the context of the present discussion, an i-statistic is a value 
computed from one or more random samples. A corresponding value com- 
puted from the population is called a parameter. When a population is too 
large to be measured and described, that is, when it is too large to permit the 
computation of parameters, it is often of interest to the researcher to obtain 
an estimate of one or more of the parameters. In general terms, an attempt 
is made to infer values of parameters from sample values, 


Estimates are divided into two large classes: point estimates and interval 
estimates. They will be discussed in that order. 


POINT ESTIMATES 


A point estimate is a single value computed from one or more samples. 
It possesses certain properties which qualify it as an estimate of the corre- 
sponding parameter. Although several different kinds of point estimates can 


be distinguished, we shall consider only point estimates which are unbiased 
or approximately so, 


Consider any i-statistic, the corr 
distribution of the i- 


Four examples of unbiased estimates are given below. 


1, M, the Sample mean, is an unbiased estimate of p, the population 


mean. That is, M,, the average of all possible sample means, is equal to the 
population mean, p. 
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S. IE iffere 
‘ia ME between the means of two samples from two popula 
| IS an unbiase i B e di i 
Me estimate of the difference between the two population 
That A pa — Be 
i D, = pı — Be 
n other i 
nins Was, the average of the totality of differences between sample 
: P Rien to the difference between the two population means. 
^, the sample variance computed with (n — 1) degrees of freedom is 


an unbi sti f i 
ased estimate of the population variance, c?. That is, 


Vi —g? 
9r, in wor : 
omaa A iy average of all possible sample variances, when each one is 
$ with (n — eor : ; 
variance, ( 1) degrees of freedom, is equal to the population 


The : 
TEES e ry may be interested in knowing that V,, the sample variance 
ayeta = id n, is a biased estimate of the population variance, since the 
Sine, of all such sample values is less than the population variance. In 
, 


V. <o 

t can be shown that Í 
RE LL yra 
iae c RR 


t-moment correlation coefficient is 


ment correlation coeflicient, is an 
f the sampling distribu- 


4 Ww 
Zero oe Pr» the population produc 
Wibinsed e New the sample product-mo 
tion of estimate of the parameter, since the mean o 
"zy, is zero, Symbolically, 
0. 


Fry = Pry 3 


Zero, sy 


is a biased estimate. An 
when it is not zero, is given by 


Whe : 
abita the population correlation is not 
Ximately unbiased estimate of pew 
day), 
Fey + fev 2(N — 1) 


I 
NTERVAL ESTIMATES 


sie pila an interval estimate of a given 
an inter ore samples, two values which are called 
Points a which is known to vary with sampling 4 
ee the interval have a sampling distribution. The interval has the 
Sometj y that it will sometimes encompass OF bracket the parameter and will 
imes fail to do so. The size of the interval can be established by the 


parameter, one computes, from 
lled /imits. These limits establish 
g. That is to say, the end- 
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researcher so that, in the totality of such intervals, some known and stated 
proportion of them will fail to bracket the parameter. By his choice o! the 
size of the interval, the researcher can make the proportion of failures as 
small as he desires. In any given attempt, of course, he has no way of knowing 
whether the interval has succeeded or failed. He anticipates that, in a large 
number of such attempts, the interval will fail to bracket the parameter with 
the relative frequency he has intended. . ] 

The logic of the procedure whereby one arrives at an interval estimate is 
illustrated below with an example involving the estimation of a population 
mean from the mean of a sample. As we indicated earlier in the discussion 
of formula (12.1), the i-statistic, 


M: — pb 


JV ijn 
is distributed as t. It will be convenient to let S, = JViln. 
Then 
LM.—y 
S, 


It follows that 
IS, = M, — p. 

The quantity, (M, 
Notice that z is neg 
We shall choose 


— #), can be thought of as the distance of M, from p. 
ative when M, < p and positive when M, > p. 

à particular value of t and designate it as 1,, where the 
subscript refers to the proportion, P. The value of t» is chosen so that P/2 
of the values of ¢ in the sampling distribution are less than 1, when it 1s 
negative, and P/2 of the values of t in the sampling distribution are greater 
than t, when it is positive. 

Consider now the relation between tS. and 1,S,. In the sampling distribu- 
tion of tS., P/2 of the values will be less than t,S, when it is given a negative 
sign, and P/2 of the values will be greater than 1,S,, when it is given a positive 
sign. That this relation holds can be verified by observing that 


tS, <1,5, 
whenever 
E 
and that 
iS, > 1,9, 
whenever 
PSs hy 


Recall that 1S, = M,— p = 
the distance of a sample mean fr 
and that it relates to tS 


of (Ma — p) in the sai 


the distance of M, from p. We conclude that 
om the population mean varies with sampling 
e Just as does 1S.. This is to say that P/2 of the values 
mpling distribution of such distances will be less than 
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Pr when it is taken to be negative, and P/2 of the values will be greater 
m pin when it is taken to be positive. We can take account of both classes 
ks ues by saying that the absolute value of (M. — p) exceeds the absolute 
value of 1,5, in P of the totality of sample values. 
wee let M, represent any point on the scale of values for sample means, 
e can construct an interval on this scale around the point, M;. The limits 
of the interval will be 
M; — bS. 
and : AT 
ME fS 
rs interval will have the following characteristics. When M, < p, the 
ONE will fail to bracket or encompass p for any sample whose mean is a 
: istance from the population mean greater than 1,S,, a circumstance which 
jx true for P/2 of samples. When M, > p, the interval will again fail to 
racket 1. for any sample whose mean is more distant from the population 
mean than /,S,, a characteristic of P/2 samples. Thus the interval, 


M, + tySe 
amples. The interval is called an interval 


estimate of the population mean, p. The value, Mz — tpSes is called the 
lower limit of the estimate. The value, M. + tS», is called the upper limit, 
Considered together, the two values are called confidence limits. The value 
ofp is called the /evel of confidence. 

Figure 16.1 shows graphically how interval estimate 


will fail to bracket p in P of all s 


s of the population 


Sample 1 


Bas Se SSeS 
M; U 


L p 
sss Eee 
S, L M; U p 
Sample 4 
i — — o c: 
Lu M. U 
"MENT E LL er aC 
L f a U 


computed from five random 
L is the lower limit of the 


opulation mean, p, 
s and the widths of 


sant 16.1 Interval estimates of the po à 
inne’ from the same population. Mz i$ the sample mean; q 
terval; U is the upper limit of the interval. Notice how the location 
€ intervals vary from one sample to another. 
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mean can vary from one sample to another. In the figure, yz is the population 
mean, M, is the sample mean, L is the lower limit of the interval, U is the 
upper limit. Notice that the location of the interval and the width of the 
interval vary. The population mean does not vary. In one of the five examples 
of Figure 16.1, the estimate fails to bracket the population mean. 


AN EXAMPLE 


An instructor in physical education wishes to determine the average pulse 
rate for college freshmen who have been enrolled in special gymnastics 
classes for a period of eight weeks. He obtains a sample of 50 students from 
a total enrollment of 2,300. The mean of the 50 determinations of pulse rate 
is 80. The variance, Vj, is 4.5. S,, which is equal to J/V//n, is 0.30. For 49 
degrees of freedom and the 0.05 level of confidence, t, = 2.01, a value 


obtained from Table A by linear interpolation. The lower limit for the interval 
estimate is 


M, — t,S, = 80 — (2.01.30) = 79.397. 
The upper limit is 


M; + t,S, = 80 + (2.01)(.30) = 80.603. 


Thus the estimate is the interval from 79.397 to 80.603. The instructor knows 
that 5 per cent of the totality of such estimates fail to bracket the parameter. 


INTERVAL ESTIMATE OF THE VARIANCE 


The problem isto establish an interval which fails to bracket the population 
variance, o?, for a proportion, P, of the totality of samples of size n. 
It can be shown that the ratio, 


(n — Dv: 


g? 


is distributed as a chi square function identified by (n — 1) degrees of freedom. 


(See Table C, in the Appendix.) The symbol for “ chi square” is X^ It 
follows that 


x: V. 
m= ww 
Let us specify a value of chi square, x?, such that P/2 of values of chi square 
in the sampling distribution are less than the specified value. Then it follows 
that 
XT x 
(n — 1) s (n — 1) 
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for P/2 i i istributi 
v Le ini in the sampling distribution. Since the left member of this 
quality is equal to the ratio, Vilo?, we can substitute the ratio and obtain 


Vi xi 
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f P "wn 
a i values in the sampling distribution. By multiplica 
oth sides of the inequality, we obtain 
(1 — DVi 
" x 
id values in the sampling distribution. Le 
fa or it will be the upper limit in the interva 
Ts c? in P/2 samples. 
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we the sampling distribution are greater 


tion and division on 
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l estimate. Note that U is less 


square, x3, such that P/2 values of 
than the specified value. It follows 
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E 
f a) @-0 
oy P/2 values in the distribution. Again we can sub: 
on the left and obtain 


stitute the ratio of Vi, to 


Vi Xe 
w 7 m-l!) 


to 


for P/2 values in the sampling distribution. By multiplication and division on 


b ; l 
Oth sides of the inequality, we obtain 
(n — DVz " 
TF > d 
X2 
istribution. We shall call the quantity on the 
mit of the interval estimate. Note that L is 


ote also that L < U because Xj < Xè » 
erval which will fail to bracket o* 1n 


f n units from à population. 


z T. values in the sampling d 
Bin for it will be the lower li 
To than o? in P/2 samples. Note 2 
Ps € two limits, L and U, form an int 
amples from the totality of samples © 


AN EXAMP 
LE 
determined for a 


s Free association times to verbs as stimulus words were 

sa of 30 male adults. The mean of the 30 response times was 1.5 seconcs. 

i e unbiased estimate of the population variance Was 0.04 seconds. m 

ce eStigator proceeded to establish an interval estimate with a level o 

Onfidence of 0.05 by computing the following quantities: 

(29)(0-04) _ 0.072, 
16.0 


(1 — We 
2 


x 
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(n — DY; (29)(0.04) 
go 45.7 


Values of chi square were obtained from Table C, in the Appendix. 


= 0.025. 


AN INTERVAL ESTIMATE OF THE PRODUCT-MOMENT 
COEFFICIENT 


Because a product-moment correlation coefficient is a dimensionless 
number not influenced by certain arbitrary scale differences, psychologists 
frequently use interval estimates of that coefficient. The problem is to obram 
limits for an interval which will be computed from r,y, the sample product 
moment correlation coefficient, and used to estimate Pry», the population 
coefficient. D. . 

When pay, the population correlation, is not zero, the distribution of Fay 
is skewed. Its distribution has been tabulated for sample sizes of 25 or less. 
For larger samples, it has been shown that 

1 l + fey 
Zr =73 log, l — ra 
is approximately normally distributed with variance, approximately 1/(N — 3). 
The problem is to estimate ¢, which is a function of Pay, as indicated by 


1 
P = 5 log a 


We determine 8,, the standard normal deviate appropriate for the level of 
confidence, P. (See the last line of Table A in the Appendix.) Then the limits 
are found in the following manner: Compute the sample correlation 
coefficient, ray. Compute Z, by the transformation formula given above. 


Compute S, = 1/,/N = 3. The lower limit is then given by the expression 
Zin 3,8. 

The upper limit is 
Z, --858,. 


These confidence limits define an interval 
parameter in approximately P of the totality 
It is usually desired to express the limits in 
available which permit one to read values of 
ponding to the upper and lower 1 
also be used for the transformatio: 


which will fail to bracket the 
of estimates. 
terms of correlation. Tables are 
a correlation coefficient corres- 
imits on the scale of Z,. These tables can 
n of r,,to Z,.2 

‘See F, N. David, Tables of the Correlation Coefficient, Biometrika Office, University 
College, London. 


. . id 
*See R. A. Fisher and Frank Yates, Statistical Tables for Biological, Agricultural a" 
Medical Research, 3rd ed. (Edinburgh and London: Oliver and Boyd, 1948). 
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AN EXAMPLE 

A public opinion pollster obtains a sample correlation of 0.60 for 
relation between income and favorableness of attitude toward a certain 
candidate for public office. N, the number of respondents in his sample, is 
103. For a level of confidence equal to 0.05, 8, is 1.96. The value of Z, 
corresponding to r+, = 0.60 is 0.69. S; is 0.1. The lower limit for the interval 
estimate of č is 


the 


0.69 — (0.1)(1.96) = 0.49. 


The upper limit is 


0.69 + (0.1)(1.96) = 0.89. 


Converted to the correlation scale, the confidence limits are 0.455 and 0.710, 


approximately. 


FREQUENCY 
CLASSIFICATIONS 


ET 


Univariate and bivariate frequency distributions have figured prominently 
in the preceding chapters. A univariate distribution expresses the relation 
between frequency and the magnitude of a single variable. A bivariate distri- 
bution expresses the relation between frequency and the magnitudes of values 
for two variables. Another important kind of research data takes the form 
of frequencies in classes or categories which are distinguished qualitatively 
rather than quantitatively. 

The simplest form of frequency classification involves one set of two 
categories. For example, a psychologist may obtain a sample of N respondents 
in a public opinion Survey. As one part of the interview conducted with each 
respondent, a question is asked and an answer of “ Yes” or “ No" is 
obtained. He computes the proportion, p of those answering “ Yes." The 
statistic, p, is an unbiased estimate of the population proportion. 


Interval Estimate of a Proportion 


described here. Tables by Clopper and Pearson (1), Fisher and Yates (3), and 
Mainland (4) are available, 


In these tables, one can locate the sample popii 
tion and read directly the confidence limits for some specified level » 
confidence. The procedure for computing approximate limits will be given 
below. 
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The sample pro i i 
portion, p, is comput 
to eta p puted and then transformed to an angle 
0 = sin"! Jp. 

T ; 
Fée in ê by means of this formula, one obtains the square root of the 
- es and reads from a table of trigonometric functions the value of the 
Pie e whose sine is the computed square root. The variance of the sampling 
- ribution of @ is approximately equal to 821/N. Values of @ are approxi- 

ately normally distributed. The lower limit of the interval estimate of the 


parametric angle is 


fat S 


i» 8, is the standard normal deviate for the desired level of confidence. 
ee the last line of Table A in the Appendix.) The upper limit is 
821 
0 4-8, TW 
inverse sine transformation is 
portion lies between 0.05 and 
de available by Fisher and 
les and for converting 


= oo resulting from use of the 
B ds ered very good when the parametric pro 
ad and when N => 20. Tables have been ma 
k s (3) for converting proportions directly to ang 
ngular limits to limits on the proportion scale. 


Single Set of More Than Two Categories 


i It may be that the psychologist obtains more than two responses to an 
Nterview question. For example, the subject may be asked to respond by 


a f 3 ; 
nswering * Yes," “ No," or “ Undecided.” These responses, 1n effect, 


classify the N subjects into three categories. It is possible to test the signi- 
in terms of the dis- 


ficance of the differences among the three categories 
Mia between the three observed frequencies and the average of the 
hree frequencies. The test of significance involves the j-statistic, chi square, 


Which is computed, as follows. 
The observed frequency, fo» is 
The average frequency, J, is given by 2f, 
and, in the example under consideration, i 
is, of course, also given by N/c. A discrepancy, 
category, Each of these discrepancies is squared. 
€ Squared deviations is computed. The i-statistic, chi square, 
Squared deviations divided by the average frequency. That is, 
2 = p 2f. —f). 
T 


of the three categories. 
Jc, where c is the number of categories 
s equal to 3. The average frequency 
Cfo E 2: computed for each 


Finally, the sum of the 
is the sum of the 


determined for each 


X 
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The null hypothesis is that variation in the observed frequencies is due to 
sampling. ] 

The psychologist chooses a level of significance and consults a table of chi 
square to determine the criterion value of chi square for (c — 1) degrees of 
freedom. (See Table C, in the Appendix.) If the obtained value exceeds the 


criterion value, he rejects the null hypothesis. If it does not, he accepts the 
null hypothesis. 


The Significance of a Difference 
Between Proportions 


When the researcher has obtained two samples from different populations 
and has determined the proportion of subjects in each sample who respond 
in a specified way, he may wish to examine the difference between the two 
proportions and make an inference about the difference between the two 
populations. The procedure for the test of significance is described below. 

A proportion is computed for each sample. Let us use p, for the first sample 
and p, for the second sample. The corresponding angles will be denoted 0; 
and 6. 0, is determined by computing the square root of p, and referring 
the root to a table of trigonometric functions for the angle whose sine is that 
square root. 0, is determined in the same way. The i-statistic used in the test 
of significance is a ratio involving the difference between the two angles and 
the value, /821/N, + 821/N,, where N, and N, are sample sizes. The ratio 
is distributed approximately as ô, the standard normal deviate. The formula 
for ô is 

= 0, — 0, 
/821/N, + 821/N, 
The null hypothesis is that any observed difference between the two 
proportions is due to sampling. 

A level of significance is chosen and the criterion value of ô is obtained 

from a table of standard normal deviates. (See the last line of Table A in the 


Appendix.) If the absolute value of the obtained à exceeds the criterion value, 
the null hypothesis is rejected. 


Joint Frequency Classifications 


It is often of interest to a psychologist to determine how the units of some 
population are jointly classified by two sets of categories. Let the first set of 
categonmes Se Isbeed Ry Jud, LL, LB, ess Ry, tnd thesccond at, Co 
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Ci Cpns Chesne Jointly classifying units of the population requires 

the (rc) subcategories defined by forming all combinations of the type R;C;. 
It is convenient to represent the joint classification, in general form, by a 

rectangular table of r rows and c columns, such as the one in Table 17.1, 


Table 17.1 
CATEGORIES AND SUBCATEGORIES FOR A JOINT CLASSIFICATION 


C Categories 
C C. C; : Cs 5 Co 


R 


Categories ` 


Each cell of the table represents a subcategory formed by combining one 
category from each of the original sets given on the margins of the table. We 
Shall assume for purposes of discussion that an entire population can be 
Classified by the subcategories of Table 17.1. 

The Purpose of investigating the joint classification would be to ascertain 
whether or not there was a relation between the two ways of classifying. No 
relation between the two ways of classifying is taken to mean that the relative 
frequency distributions of all columns are the same and that the relative 
lequency distributions of all rows are the same. When these conditions 
Obtain, We say that the two ways of classifying are independent. We mean 
Simply that the relative frequency distributions of either classification do not 
Vary. In still other words, the differences among column relative frequencies 
9 not vary with rows. 


If we let 

P.r = the population relative frequency for category C;, 

Pi. = the population relative frequency for category R;, and 
Pj; = the population relative frequency for subcategory R;C;, 


then the joint, relative frequency distribution for the population can be 


represented, in general form, by Table 17.2. When the two ways of classifying 
are independent, that is, when there is no relation between them, 


Pik — Pj. 
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When this equality does not hold for all (rc) cells for the population joint 
classification, we say that the two ways of classifying are not independent 
and that there is a relation between them. 


Table 17.2 
THE JOINT RELATIVE FREQUENCY DISTRIBUTION FOR A 
POPULATION 


Relative Frequencies for the C Categories 
Da D.s P.a : Pir A D.c 


pi. 
Relative z 
WE t 
Frequencies 
for the R 
' Dj. 
Categories 
Pr. 


When the population is too large to be classified in its entirety, the 
investigator may obtain a random sample of N units from the population, 
classify the units in the sample into the subcategories of an (r x c) table, and 
make an inference from the sample to the population regarding the relation 
between the two ways of classifying. 

The test of significance which can be applied to the sample frequency data 
requires the computation of chi square, as indicated by the formula: 


om ps : 


where f, is the observed frequency in each subcategory and /,, the so-called 
expected frequency, is computed from the marginal frequencies and the total 
frequency, as indicated by the expression, /f,/N. The summation has (rc) 
terms, consisting of one value from each cell. The number of degrees O 
freedom is (r — 1)(c — 1). 

One can imagine there is a totality of samples of size N which could be 
formed from the population and there is, also, a corresponding totality of 
values of chi square. These chi square values could be thought of as forming 
a sampling distribution, but this is not the distribution employed in the test 
of significance. The observed marginal frequencies for the particular sample 
obtained by the investigator identify a subset of samples and a corresponding 
subset of values of chi square from the totality of such values. The values of 
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Chi square in the subset form the sampling distribution on which the test of 
Significance is based. 


AN EXAMPLE 


The example we shall give would be classified, in our system, as an empirical 
Tesearch study. 

A psychologist is hired to determine whether political affiliation is related 
to Position on a current issue, where the position of an individual subject is 
indicated by a response of “ Yes" or " No” to an interview question. A 
Tandom sample of 150 registered voters is obtained and they are jointly 
Classified as being Democrat or Republican and as having answered “ Yes " 
or “ No ” to the question. The joint classification for the sample is given in 


Table 17.3. The computation of the expected frequency for each cell is shown 
In the table. 


Table 17.3 
JOINT CLASSIFICATION OF VOTERS 
Democrat Republican fi. 

=2 100 
Yes fo i 
No aun s 

f. = (50)(90)/150 = 30 fe = (50)(60)/150 = 20 

Sx N = 150 
f = observed frequency. 


expected frequency. 


The null hypothesis is that any discrepancies between observed and 
SXpected frequencies are due to sampling. 

The i-statistic, chi square, is computed by finding the difference between 
the observed and expected frequency for each cell, squaring that difference, 
dividing the squared difference by the expected frequency, and summing 


these four quotients for the four cells of the 2 x 2 table. The computation is 
Blven below. 


e _ (75-60)? , (25-40? , (15-30? , (35-20)? — 
cw T ws C — x Es cos 
The criterion value of chi square for (r — 1)(c — 1) = 1 degree of freedom 
and the 0.05 level of significance is 3.841. (See Table C, in the Appendix.) 
he null hypothesis is rejected and the investigator announces the finding of 
a relation between political affiliation and position on the issue. x 
Since the chi square function is a continuous curve and the values computed 
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from frequency data are always discrete values, the actual sampling distribu- 
tion is only approximated by the continuous function. For large samples, the 
disparity between the two distributions is negligible. For small samples. the 

disparity can be serious. In 2 x 2 classifications, it is possible to improve the 

approximation by reducing the absolute value of each difference, (f, — fe) 

by 0.5 before computing chi square. The effect of this correction is to reduce 

the size of chi square. If one wishes to avoid the problem of discontinuity. 

one should apply the exact test described by Fisher (2). 

It is interesting to note that 


x = Né? 


where ¢ is the phi coefficient, the product-moment coefficient for two 


dichotomous variables, when each has been coded to or assigned values of 
O and I. 
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APPENDIX 


TABLE A“ 


Values of t exceeded by 0.025 of values in the sampling distribution. 

In testing the significance of a difference between two sample means, the number 
of degrees of freedom is 2(1 — 1) where n is the number of units per sample. 

In testing the significance of a product-moment correlation coefficient, the number 
of degrees of freedom is (N — 2) where N is the number of pairs in the sample. 

In computing an interval estimate of the population mean, the number of degrees 
of freedom is (n — 1) where n is the number of units in the sample. 

When the number of degrees of freedom is indefinitely large, t is equal to 8, the 
Standard normal deviate. Therefore, the value of à exceeded by 0.025 of values in 
the sampling distribution is given in the last line of the table. 

If values of ¢ and ô are given negative signs, then these values exceed 0.025 of 
values in their respective distributions. 


t 
12.706 
4.303 
3.182 
2.776 
2.571 
2.447 
2.365 
2.306 
2.262 
10 2.228 
11 2.201 
12 2.179 


“Table A is abridged from Table II for Fisher and Yates, Srarcsrtca! Tables ie Bitte Mean 
Agricultural and Medical Research, published by Oliver & Boyd Ltd., Edinburgh, by 


permission of the authors and publishers. 
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TABLE A continued 


13 2.160 
14 2.145 
15 2.131 
16 2.120 
17 2.110 
18 2.101 
19 2.093 
20 2.086 
21 2.080 
22 2.074 
23 2.069 
24 2.064 
25 2.060 
26 2.056 
27 2.052 
28 2.048 
29 2.045 
30 2.042 
40 2.021 
60 2.000 

120 1.980 
eo 1.960^ 


. "Note that ¢ = ô, the standard normal deviate, when the number of degrees of freedom 
is indefinitely large. 
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Values of F exceeded by 0.05 of the values in the sampling distribution. 


TABLE B;* 


297 


The table can be used, when the 0.05 level of significance is desired, in testing 
differences between means, correlation, and homogeneity of sample correlations in 


analysis of covariance. 


1 


4 7.71 

5 6.61 

6 5.99 

7 5.59 

8 5.32 

9 5.12 

10 4.96 

11 4.84 

12 4.75 

df 13 4.67 
for 14 4.60 
Variance 15 4.54 
in 16 4.49 
Denominator 17 4.45 
18 4.41 

19 4.38 

20 4.35 

21 4.32 

22 4.30 

23 4.28 

24 4.26 

25 4.24 

30 4.17 

40 4.08 


df for Variance in Numerator 


2 


6.94 
5:79 
5.14 
4.74 
4.46 
4.26 
4.10 
3.98 
3.89 
3.81 
3.74 
3.68 
3.63 
3.59 
3.55 
3.52 
3.49 
3.47 
3.44 
3.42 
3.40 
3/89 
3.32 
3.23 


3 


6.59 
5.41 

4.76 
4.35 
4.07 
3.86 
3.71 

3.59 
3.49 
3.41 

3.34 
3.29 
3.24 
3.20 
3.16 
3.13 
3.10 
3.07 
3.05 
3.03 
3.01 

2.99 
2.92 
2.84 


4 


6.39 
5.19 
4.53 
4.12 
3.84 
3.63 
3.48 
3.36 
3.26 
3.18 
241 

3.06 
3.01 
2.96 
2.93 
2.90 
2.87 
2.84 
2.82 
2.80 
2.78 

2.76 
2.69 
2.61 


5 


6.26 
5.05 
4.39 
3:97 
3.69 
3.48 
3.33 
3.20 
3.11 
3.03 
2.96 
2.90 
2.85 
2.81 
2.77 
2.74 
2.71 
2.68 
2.66 
2.64 
2.62 
2.60 
2.53 
2.45 


6 


6.16 
4.95 
4.28 
3.87 
3.58 
3.37 
3.22 
3.09 
3.00 
2.92 
2.85 
2.79 
2.74 
2.70 
2.66 
2.63 
2.60 
2.57 
2.55 
2.53 
2.51 

2.49 
2.42 
2.34 


1 


6.09 
4.88 
4.21 
3.79 
3.50 
3.29 
3.14 
3.01 
2.91 
2.83 
2.76 
2.71 
2.66 
2.61 

2.58 
2.54 
2.51 

2.49 
2.46 
244 
2.42 
2.40 
2.33 
2.25 


Interpolation may be performed using reciprocals of the degrees of freedom. 


This table is abridged from M. Merrington and C. M. Thompson, “ Tables of Per- 
centage Points of the Inverted Beta Distribution," Biometrika, 33: 73 (1943). It is printed 


here with permission of the Trustees of Biometrika. 
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TABLE B;* 


Values of F exceeded by 0.025 of the values in the sampling distribution. 

If, in testing the homogeneity of two sample variances, the larger variance is 
placed over the smaller, the number of ratios greater than any given value, equal to 
or greater than unity, is doubled. Therefore use of the values tabled below, in 
comparing two sample variances, will provide a 0.05 level of significance. 


df for Larger Variance (numerator) 


4 5 6 7 8 9 10 12 15 20 

4 9.60 9.36 9.20 9.07 8.98 $8.90 8.84 58.75 8.66 8.56 

5 7.39 7.15 6.98 6.85 6.76 6.68 662 6.52 6.43 6.33 

df 6 6.23 5.99 5.82 5.70 5.60 5.52 546 5.37 527 5.7 
for 7 5.52 529 5.2 4.99 490 4.82 4.76 4.67 4.57 447 
Smaller 8 5.05 482 4.65 4.53 443 4.36 4.30 420 4.10 4.00 
Variance 9 4.72 4.48 4.32 420 4.10 4.03 3.96 3.87 3.77 3.67 
(denomi- 10 4.47 4.24 4.07 3.95 3.85 3.78 3.72 3.62 3.52 3.42 
nator) 12 4.12 3.89 3.73 3.61 3.51 3.44 3.37 328 3.18 3.07 
15 3.80 3.58 3.41 329 320 3.12 3.06 2.96 2.86 2.76 

20 3.51 329 3.13 3.01 2.91 2.84 2.77 2.68 2.57 246 


Interpolation may be performed using reciprocals of the degrees of freedom. 


?This table is abridged from M. Merrington and C. M. Thompson, “ Tables of Per- 
centage Points of the Inverted Beta Distribution," Biometrika, 33: 73 (1943). It is published 
here with permission of the Trustees of Biometrika. 


Appendix 299 
TABLE C;* 


Values of chi square exceeded by 0.05 of values in the sampling distribution. 

In using chi square for a test of significance on a single set of c categories, the 
number of degrees of freedom is (c — 1). In using chi square for a test of significance 
on a joint classification of r rows and c columns, the number of degrees of freedom 


is (r — 1)(¢ — 1). 


df x 
1 3.841 
2 5.991 
3 7.815 
4 9.488 
5 11.070 
6 12.592 
* 14.067 
8 15.507 
9 16.919 
10 18.307 
11 19.675 
12 21.026 
13 22.362 
14 23.685 
15 24.996 
16 26.296 


"Table C, is abridged from Table III of R. A. Fisher, Statistical Methods for Research 
Workers, published by Oliver & Boyd Ltd., 1946, Edinburgh, by permission of the author 
and publishers. 


300 


TABLE C;* 


Appendix 


Two chi square values are given below for each number of degrees of freedom. 
The value in the left column, x?, exceeds 0.025 of the values in the sampling distribu- 
tion. The value in the right column, x3, is exceeded by 0.025 of the values in the 


sampling distribution. 


In using these values to establish an interval estimate of the population variance, 
the number of degrees of freedom is (n — 1) where n is the number of measures in 


à random sample. 


“Table C, was abridged from C. M. Thompson, 
Incomplete Beta Function and of the Chi Square Dist 
(1941). It is printed here with permission of the Trus! 


x? 


0.484 
0.831 
1.24 
1.69 
2.18 
2.70 
3.25 
3.82 
4.40 
5.01 
5.63 
6.26 
6.91 
7.56 
8.23 
8.91 
9.59 
10.3 
11.0 
11.7 
12.4 
13.1 
13.8 
14.6 
15.3 
16.0 
16.8 


Xs 
11.1 
12.8 
14.4 
16.0 
17.5 
19.0 
20.5 
21.9 
23.3 
24.7 
26.1 
27.5 
28.8 
30.2 
31.5 
32.9 
34.2 
35.5 
36.8 
38.1 
39.4 
40.6 
41.9 
43.2 
44.5 
45.7 
47.0 


“Tables of Percentage Points of the 
ribution,” Biometrika, 32: pp. 188-189 
tees of Biometrika. 
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Analysis of covariance, 271 
Analysis of variability, 184-200 
and correlation, 198-199 
and description, 199-200 
double classification, 191-197 
single classification, 184-190 
Analysis of variance 
completely randomized design, 258 
matched-group design, 268 


Chi square distribution, 213 
Chi square in 

estimating variance, 284-286 

joint classification, 292-294 

single set of categories, 289-290 
Classifications, 190-191, 290-291 
Coefficient of correlation 

and double classification, 198-199 

computing formula, 142 

definition, 129-130 

for ranks, 155-156 

interval estimate, 286-287 

point estimate, 281 

properties, 141-147 

restrictions on, 156-157 

significance of, 279-280 

special formulas, 148-152 
Coefficient of determination, 145 
Coefficient of nondetermination, 145 
Covariance 

computing formula, 134 

definition, 129 

properties, 130-131 
Curvilinear correlation and prediction, 177- 

179 


d-Statistic, 15 
correlation coefficient, 177, 181-183 
descriptive moments, 117-121, 124-125 


INDEX 


mean, 121 

partial correlation, 179 

prediction constant, 177-178 

variance, 123 
Description 

and analysis of variability, 199-200 

limitations of, 125-127 

with moments, 114-127 

with product moments, 173-183 
Designatum, 3 
Difference between means 

distribution of, 213 

point estimate, 281 

significance of, 258, 268, 274, 279 
Difference between proportions, 290 
Distribution in semantical confirmation, 

100-101 


Empirical research, 246 
Equations, 42-48 
solving of, 43-45 
systems of, 46-48 
Estimating parameters, 251-252 
correlation coefficient, 281, 286-287 
difference between means, 281 
in empirical research, 280-287 
interval estimates, 281-287 
mean, 280, 281-284 
point estimates, 280-281, 288 
proportion, 288-289 
variance, 281, 284-286 
Experimental designs 
completely randomized, 254-262 
matched-group, 262-268 
with adjusting, 269-277 
Experimental research, 246 
Exponents, 31-34 
logarithms as, 33-34 
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Factorial notation, 36 

Fractions, 24-25 

Frequency classifications, 288-294 

Frequency distributions, 80-85 
and descriptive moments, 81-85 
bivariate, 131-134, 149, 151 

Functions, 48-55 
minimizing the value of, 52-53 
of angles, 53-55 


Graphs, 49-55 
Greek alphabet, 26 


Homegeneity of correlations, 271, 276 
Homogeneity of variances, 259, 271, 275 


i-Statistic, 15, 249-252 
between-sample variance, 213-215 
chi square, 284-286, 289-290, 292-293, 294 
correlation coefficient, 217, 220, 270, 280, 
281, 286 
5, 220-221 
difference between means, 213, 279, 281 
F, 218-220, 258, 259, 261, 262, 268, 211, 
275, 276 
mean, 205-209, 211-212, 280, 283 
proportion, 288-289 
t, 215-217, 258, 279, 280 
variance, 212-213, 281, 284 
within-sample variance, 213-215 
Index of kurtosis, 74 
computing formula, 74 
properties of, 79 
Index of skewness, 72-73 
computing formula, 73 
properties of, 78-79 
Inference, 126-127, 244-300 
estimating parameters, 251-252 
in empirical research, 245, 278-300 
in experimental research, 245, 253-277 
testing significance, 249-251 
Interval estimates, 281-287 
correlation coefficient, 286-287 
mean, 281-284 
proportion, 288-289 
variance, 284-286 


Kurtosis (See Index of kurtosis) 


Language 
mathematical, 1-2 
natural, 1 
of empirical science, 1 
psychological statistics as, 7-8 
study of, 2 

Logarithms, 33-34 


Index 


Matrix notation, 166-169 
Mean, 69 
distribution of, 205-209, 211-212 
interval estimate, 281-284 
point estimate, 280 
properties of, 75-77 
Measurement, 96-113 
criteria, 100-106 
distribution, 100-101 
predictiveness, 104-106 
reproducibility, 101-103 
the functional approach, 100-106 
the structural approach, 97-100 
theory, 112-113 
Moments, 68-95 
descriptive, 74-79, 114-127 
index of kurtosis, 74 
index of skewness, 72-73 
the mean, 69 
the variance, 70-71 
Multiple correlation, 170-172 
Multiple prediction, 161-170 


Nonadditive effects 
completely randomized design, 258-259 
design with adjusting, 275-276 
Nonnormality, 228 
and transformations, 242 
consequences, 229-230 
Normal probability function, 210 
Normality 
departures from, 228-230 


Order of arithmetic operations, 17-21 


Partial correlation, 160-161 
Point estimates, 280-281 
correlation coefficient, 281 
difference between means, 281 
mean, 280 
proportion, 288 
variance, 281 
Practical prediction, 180-181 
Pragmatics, 3, 6 
and the analysis of variability, 199-200 
description, 114-127, 173-183 
empirical research, 278-287 
estimating parameters, 251-252 
experimental inference, 253-277 
inference and research, 244-252 
problems in, 12-14 
testing significance, 249-251 
Prediction coefficients, 134-141, 161-170 
Prediction constants, 134-141, 161-170 
Prediction equations, 135, 138, 139, 142, 
169, 177, 178 


Index 


Predictiveness, 104-106, 182 
Product moments, 128-159 
correlation coefficient, 129-130 
covariance, 129 
descriptive, 173-183 
Proportion 
interval estimate, 288-289 
point estimate, 288 


Random interval procedure, 239 
Random ordering, 240-241 
Random sorting, 240 
Ranks 
correlation of, 155-156 
mean of, 92 
substitution of, 92, 155 
variance of, 92 
Reliability, 101-103, 181-182 
Reproducibility, 101-103, 181-182 
Research, 245-249 
empirical, 246, 278-300 
experimental, 246, 253-277 
Roots and radicals, 34-36 


Sampling 
incorrect practice, 241-242 
operations, 230-238 
practice, 226-243 
theory, 201-209 
Sampling distributions, 210-225 
between-sample variance, 213-215 
chi square, 213 
8, 220-221 
difference between means, 213 
F, 218-220 
mean, 205-209, 211-212 
t, 215-217 
tabled values, 221-225 
variance, 212-213 
within-sample variance, 213-215 
Semantics, 3-5 
measurement, 96-113 
practical random sampling, 226-243 
problems in, 9-11 
Sign 
of summation, 56-67 
„unit of language, 2 
Signed numbers, 21-24 
Skewness (See Index of skewness) 
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Standard deviation, 70 
Standard normal deviate, 220-221 
Summation sign, 56-67 
definitions, 56-57 
in equations, 63-65 
limits and indices, 65-67 
simplification rules, 60-61 
Syntactics, 3 
algebra, 25-55 
analysis of variability, 184-200 
arithemetic, 17-25 
moments, 68-95 
partial and multiple correlation, 160-172 
problems in, 8-9 
product moments, 128-159 
sampling distributions, 210-225 
sampling theory, 201-209 
summation sign, 56-67 


Tabled Functions, 221-225 
chi square, 299-300 
8, 296 
F, 297-298 
1, 295-296 
Test construction, 181-182 
Testing significance, 249-251 
empirical research, 278-280 
experimental research, 253-277 
frequency classifications, 289-294 
Transformations 
and nonnormality, 242 
coding, 85-90, 147-152 
deviation from the mean, 69 
special, 90-91, 152-155 
standard score, 71-72 


Validity, 104-106, 182 

Variance 
between-sample, 213-215 
distribution of, 212-213 
of a sample, 212-213 
within-sample, 213-215 

Variance of a set of numbers, 70-71 
computing formula, 71 
properties of, 77-78 

Variance ratio, 259 
distribution of, 218-220 
significance of, 258-259, 268, 271, 274- 

276, 279 
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Another important book 
by William S. Ray 


AN INTRODUCTION TO 
EXPERIMENTAL DESIGN 
by Wnaaaw S. Ray 
The Woman's College of 
the University of North Carolina 


Beginning with simple planning prob- 
lems and a review of elementary statis- 
tics, this text proceeds to a more complex 
statistical approach through an analysis 
of variability. The author stresses de- 
signs important in actual practice. He 
describes the arithmetic of an analysis, 
explains its relevance to an experimental 
situation, and concludes with a presenta- 
tion of the logic and mechanics of an 
evaluation. 
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PSYCHOLOGICAL TESTING 
Second Edition 
by ANNE ANASTASI 


Graduate School, Fordham University 


The latest edition of this text presents a general introduction 
to principles and techniques in psychological testing, including 
a survey of the majer ‘ypes of tests currently available. It gives 
a detailed account of outstanding examples of each type, cover- 
ing intelligence, aptitude, achievement, and personality meas- 
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texts in the field, this book emphasizes the understanding of 
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LEARNING AND BEHAVIOR * 
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The Ohio State University 


Relating the study of learning to the understanding of be- 
havior, this text presents an integrated view of the entire range 
of learning research. The author begins with an analysis of 
behavior in terms of a stimulus-respouse-reinforcement model. 
Within this framework he discusses successively simple habit 
formation, the interaction of simple habits, factors presumed to 
complicate thc simple learning model, and “complex” learning 
cases. Written from the standpoint of experimental, behavior- 
istic psychology, the book cites the most recent experimental 
studies. 
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